User contributions for Pmcwhann
Jump to navigation
Jump to search
17 November 2020
- 19:0219:02, 17 November 2020 diff hist −556 Pre-Training Tasks For Embedding-Based Large-Scale Retrieval No edit summary
- 18:5418:54, 17 November 2020 diff hist +18 stat940F21 →Paper presentation
- 18:5418:54, 17 November 2020 diff hist 0 stat940F21 →Paper presentation
- 18:5318:53, 17 November 2020 diff hist +1,921 N Pre-Training Tasks For Embedding-Based Large-Scale Retrieval Created page with "==Introduction== One of the ways humans learn language, especially second language or language learning by students, is by communication and getting its feedback. However, mo..."
- 18:5218:52, 17 November 2020 diff hist +2,068 N Pre-Training-Tasks-For-Embedding-Based-Large-Scale-Retrieval Created page with "This page is a summary for NIPS 2016 paper <i>Dialog-based Language Learning</i> [1]. ==Introduction== One of the ways humans learn language, especially second language or la..." current
- 18:5118:51, 17 November 2020 diff hist +107 stat940F21 →Paper presentation
16 November 2020
- 00:1300:13, 16 November 2020 diff hist +2 orthogonal gradient descent for continual learning →Orthogonal Gradient Descent
- 00:1300:13, 16 November 2020 diff hist +68 orthogonal gradient descent for continual learning →References
- 00:1200:12, 16 November 2020 diff hist +1,481 orthogonal gradient descent for continual learning →Orthogonal Gradient Descent
15 November 2020
- 23:3423:34, 15 November 2020 diff hist +300 orthogonal gradient descent for continual learning →Orthogonal Gradient Descent
13 November 2020
- 03:1603:16, 13 November 2020 diff hist +191 a fair comparison of graph neural networks for graph classification →Risk Assessment and Model Selection
- 03:1303:13, 13 November 2020 diff hist +177 a fair comparison of graph neural networks for graph classification →Graph basics
- 03:1003:10, 13 November 2020 diff hist +1,115 a fair comparison of graph neural networks for graph classification →Graph basics
- 02:4102:41, 13 November 2020 diff hist +442 a fair comparison of graph neural networks for graph classification →Graph Neural Networks
- 02:1602:16, 13 November 2020 diff hist +148 The Curious Case of Degeneration →Distributional Statistical Evaluation
- 02:1502:15, 13 November 2020 diff hist −3 The Curious Case of Degeneration →Perplexity
- 02:1502:15, 13 November 2020 diff hist 0 The Curious Case of Degeneration →Introduction
- 02:1502:15, 13 November 2020 diff hist +91 The Curious Case of Degeneration →Perplexity
- 02:1402:14, 13 November 2020 diff hist +76 The Curious Case of Degeneration →Introduction
- 02:1302:13, 13 November 2020 diff hist +65 The Curious Case of Degeneration →Introduction
- 02:1102:11, 13 November 2020 diff hist −19 The Curious Case of Degeneration →Language Model Decoding
- 02:1002:10, 13 November 2020 diff hist −4 The Curious Case of Degeneration →Language Model Decoding
- 02:0802:08, 13 November 2020 diff hist +13 The Curious Case of Degeneration →Top-k Sampling
- 02:0802:08, 13 November 2020 diff hist +13 The Curious Case of Degeneration →Sampling with Temperature
- 02:0702:07, 13 November 2020 diff hist +19 The Curious Case of Degeneration →Sampling with Temperature
- 02:0502:05, 13 November 2020 diff hist −1 The Curious Case of Degeneration →Conclusion
- 02:0502:05, 13 November 2020 diff hist −1 The Curious Case of Degeneration →Conclusion
- 02:0402:04, 13 November 2020 diff hist +6 The Curious Case of Degeneration →What is Perplexity?
- 02:0302:03, 13 November 2020 diff hist +1 The Curious Case of Degeneration →Top-k Sampling
- 02:0202:02, 13 November 2020 diff hist 0 The Curious Case of Degeneration →Top-k Sampling
- 01:5901:59, 13 November 2020 diff hist +664 The Curious Case of Degeneration →What is Perplexity?
- 01:4201:42, 13 November 2020 diff hist +624 The Curious Case of Degeneration →What is Perplexity?
- 01:3101:31, 13 November 2020 diff hist +59 The Curious Case of Degeneration →References
- 01:3001:30, 13 November 2020 diff hist +1 The Curious Case of Degeneration →What is Perplexity?
- 01:2901:29, 13 November 2020 diff hist +120 The Curious Case of Degeneration →Perplexity
8 November 2020
- 21:0421:04, 8 November 2020 diff hist +2 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 20:5820:58, 8 November 2020 diff hist −5 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 20:5720:57, 8 November 2020 diff hist 0 N File:SOP.PNG No edit summary current
- 20:5620:56, 8 November 2020 diff hist −37 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 20:5520:55, 8 November 2020 diff hist +18 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 20:5520:55, 8 November 2020 diff hist +1,114 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
3 November 2020
- 22:5922:59, 3 November 2020 diff hist +529 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Factorized embedding parameterization
- 14:3814:38, 3 November 2020 diff hist 0 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →conclusion
- 14:3714:37, 3 November 2020 diff hist 0 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 14:3614:36, 3 November 2020 diff hist 0 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
2 November 2020
- 19:1319:13, 2 November 2020 diff hist −1 Learning The Difference That Makes A Difference With Counterfactually-Augmented Data →Experiments
- 19:1219:12, 2 November 2020 diff hist −2 Learning The Difference That Makes A Difference With Counterfactually-Augmented Data →Experiments
- 17:4117:41, 2 November 2020 diff hist −4 GradientLess Descent →GradientLess Descent Algorithm
- 17:3817:38, 2 November 2020 diff hist +40 GradientLess Descent →GradientLess Descent Algorithm
- 17:3617:36, 2 November 2020 diff hist 0 GradientLess Descent →GradientLess Descent Algorithm