User contributions for Pmcwhann
Jump to navigation
Jump to search
17 November 2020
- 20:0220:02, 17 November 2020 diff hist −556 Pre-Training Tasks For Embedding-Based Large-Scale Retrieval No edit summary
- 19:5419:54, 17 November 2020 diff hist +18 stat940F21 →Paper presentation
- 19:5419:54, 17 November 2020 diff hist 0 stat940F21 →Paper presentation
- 19:5319:53, 17 November 2020 diff hist +1,921 N Pre-Training Tasks For Embedding-Based Large-Scale Retrieval Created page with "==Introduction== One of the ways humans learn language, especially second language or language learning by students, is by communication and getting its feedback. However, mo..."
- 19:5219:52, 17 November 2020 diff hist +2,068 N Pre-Training-Tasks-For-Embedding-Based-Large-Scale-Retrieval Created page with "This page is a summary for NIPS 2016 paper <i>Dialog-based Language Learning</i> [1]. ==Introduction== One of the ways humans learn language, especially second language or la..." current
- 19:5119:51, 17 November 2020 diff hist +107 stat940F21 →Paper presentation
16 November 2020
- 01:1301:13, 16 November 2020 diff hist +2 orthogonal gradient descent for continual learning →Orthogonal Gradient Descent
- 01:1301:13, 16 November 2020 diff hist +68 orthogonal gradient descent for continual learning →References
- 01:1201:12, 16 November 2020 diff hist +1,481 orthogonal gradient descent for continual learning →Orthogonal Gradient Descent
- 00:3400:34, 16 November 2020 diff hist +300 orthogonal gradient descent for continual learning →Orthogonal Gradient Descent
13 November 2020
- 04:1604:16, 13 November 2020 diff hist +191 a fair comparison of graph neural networks for graph classification →Risk Assessment and Model Selection
- 04:1304:13, 13 November 2020 diff hist +177 a fair comparison of graph neural networks for graph classification →Graph basics
- 04:1004:10, 13 November 2020 diff hist +1,115 a fair comparison of graph neural networks for graph classification →Graph basics
- 03:4103:41, 13 November 2020 diff hist +442 a fair comparison of graph neural networks for graph classification →Graph Neural Networks
- 03:1603:16, 13 November 2020 diff hist +148 The Curious Case of Degeneration →Distributional Statistical Evaluation
- 03:1503:15, 13 November 2020 diff hist −3 The Curious Case of Degeneration →Perplexity
- 03:1503:15, 13 November 2020 diff hist 0 The Curious Case of Degeneration →Introduction
- 03:1503:15, 13 November 2020 diff hist +91 The Curious Case of Degeneration →Perplexity
- 03:1403:14, 13 November 2020 diff hist +76 The Curious Case of Degeneration →Introduction
- 03:1303:13, 13 November 2020 diff hist +65 The Curious Case of Degeneration →Introduction
- 03:1103:11, 13 November 2020 diff hist −19 The Curious Case of Degeneration →Language Model Decoding
- 03:1003:10, 13 November 2020 diff hist −4 The Curious Case of Degeneration →Language Model Decoding
- 03:0803:08, 13 November 2020 diff hist +13 The Curious Case of Degeneration →Top-k Sampling
- 03:0803:08, 13 November 2020 diff hist +13 The Curious Case of Degeneration →Sampling with Temperature
- 03:0703:07, 13 November 2020 diff hist +19 The Curious Case of Degeneration →Sampling with Temperature
- 03:0503:05, 13 November 2020 diff hist −1 The Curious Case of Degeneration →Conclusion
- 03:0503:05, 13 November 2020 diff hist −1 The Curious Case of Degeneration →Conclusion
- 03:0403:04, 13 November 2020 diff hist +6 The Curious Case of Degeneration →What is Perplexity?
- 03:0303:03, 13 November 2020 diff hist +1 The Curious Case of Degeneration →Top-k Sampling
- 03:0203:02, 13 November 2020 diff hist 0 The Curious Case of Degeneration →Top-k Sampling
- 02:5902:59, 13 November 2020 diff hist +664 The Curious Case of Degeneration →What is Perplexity?
- 02:4202:42, 13 November 2020 diff hist +624 The Curious Case of Degeneration →What is Perplexity?
- 02:3102:31, 13 November 2020 diff hist +59 The Curious Case of Degeneration →References
- 02:3002:30, 13 November 2020 diff hist +1 The Curious Case of Degeneration →What is Perplexity?
- 02:2902:29, 13 November 2020 diff hist +120 The Curious Case of Degeneration →Perplexity
8 November 2020
- 22:0422:04, 8 November 2020 diff hist +2 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 21:5821:58, 8 November 2020 diff hist −5 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 21:5721:57, 8 November 2020 diff hist 0 N File:SOP.PNG No edit summary current
- 21:5621:56, 8 November 2020 diff hist −37 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 21:5521:55, 8 November 2020 diff hist +18 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 21:5521:55, 8 November 2020 diff hist +1,114 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
3 November 2020
- 23:5923:59, 3 November 2020 diff hist +529 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Factorized embedding parameterization
- 15:3815:38, 3 November 2020 diff hist 0 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →conclusion
- 15:3715:37, 3 November 2020 diff hist 0 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
- 15:3615:36, 3 November 2020 diff hist 0 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations →Inter-sentence coherence loss
2 November 2020
- 20:1320:13, 2 November 2020 diff hist −1 Learning The Difference That Makes A Difference With Counterfactually-Augmented Data →Experiments
- 20:1220:12, 2 November 2020 diff hist −2 Learning The Difference That Makes A Difference With Counterfactually-Augmented Data →Experiments
- 18:4118:41, 2 November 2020 diff hist −4 GradientLess Descent →GradientLess Descent Algorithm
- 18:3818:38, 2 November 2020 diff hist +40 GradientLess Descent →GradientLess Descent Algorithm
- 18:3618:36, 2 November 2020 diff hist 0 GradientLess Descent →GradientLess Descent Algorithm