User contributions for K4liang
Jump to navigation
Jump to search
10 April 2025
- 20:1120:11, 10 April 2025 diff hist +78 stat946W25 →Simplicity and Efficiency
- 20:1120:11, 10 April 2025 diff hist 0 N File:Average numbers of API calls.jpg No edit summary current
- 13:2013:20, 10 April 2025 diff hist +351 stat946W25 →Key Contributions
- 13:1713:17, 10 April 2025 diff hist +1,372 stat946W25 →Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
- 12:5512:55, 10 April 2025 diff hist 0 N File:ITER-RETGEN iterates retrieval and generation.jpg No edit summary current
- 12:5212:52, 10 April 2025 diff hist +1,323 stat946W25 →Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
26 March 2025
- 16:5516:55, 26 March 2025 diff hist +2,191 stat946W25 →Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
19 March 2025
- 14:4314:43, 19 March 2025 diff hist +279 stat946W25 →Zero-shot Tasks
- 14:3914:39, 19 March 2025 diff hist 0 N File:Mean zero-shot accuracy.png No edit summary current
- 14:3914:39, 19 March 2025 diff hist +49 stat946W25 →Empirical Result
- 14:3514:35, 19 March 2025 diff hist +431 stat946W25 →Empirical Result
- 14:3114:31, 19 March 2025 diff hist 0 File:results on WikiText2.png K4liang uploaded a new version of File:results on WikiText2.png current
- 14:2914:29, 19 March 2025 diff hist +71 stat946W25 →Empirical Result
- 14:2714:27, 19 March 2025 diff hist 0 N File:results on WikiText2.png No edit summary
- 14:2514:25, 19 March 2025 diff hist +73 stat946W25 →SliceGPT
- 14:1614:16, 19 March 2025 diff hist +902 stat946W25 →SliceGPT
- 13:4713:47, 19 March 2025 diff hist +568 stat946W25 →SliceGPT
- 13:4113:41, 19 March 2025 diff hist 0 N File:QwithRMSNorm.png No edit summary current
- 13:4013:40, 19 March 2025 diff hist +174 stat946W25 →SliceGPT
- 13:3813:38, 19 March 2025 diff hist 0 N File:rmsnorm.png No edit summary current
- 13:3513:35, 19 March 2025 diff hist +191 stat946W25 →SliceGPT
- 13:2113:21, 19 March 2025 diff hist −1 stat946W25 →Computational invariance
- 13:2013:20, 19 March 2025 diff hist +1,656 stat946W25 →Transformer networks
- 12:4612:46, 19 March 2025 diff hist +1 stat946W25 →Computational invariance
17 March 2025
- 20:1420:14, 17 March 2025 diff hist +65 stat946W25 →Computational invariance
- 20:0920:09, 17 March 2025 diff hist 0 N File:invariance Theorem.png No edit summary current
- 20:0820:08, 17 March 2025 diff hist +1,925 stat946W25 →Attention Is All You Need But You Don’t Need All Of It For Inference of Large Language Models
13 March 2025
- 13:2313:23, 13 March 2025 diff hist +38 stat946W25 No edit summary
12 March 2025
- 21:4621:46, 12 March 2025 diff hist +1 stat946W25 →TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer
- 21:3821:38, 12 March 2025 diff hist −18 stat946W25 →Key Contributions
- 21:3521:35, 12 March 2025 diff hist −31 stat946W25 →Key Contributions
- 21:3221:32, 12 March 2025 diff hist 0 File:RobustAlgorithm.jpg K4liang uploaded a new version of File:RobustAlgorithm.jpg current
- 21:2921:29, 12 March 2025 diff hist +65 stat946W25 →Key Contributions
- 21:2321:23, 12 March 2025 diff hist 0 N File:RobustAlgorithm.jpg No edit summary
- 21:2121:21, 12 March 2025 diff hist +34 stat946W25 →Key Contributions
- 16:5116:51, 12 March 2025 diff hist +1 stat946W25 →Theorem 2
- 16:4916:49, 12 March 2025 diff hist +33 stat946W25 →Theorem 2
- 16:4516:45, 12 March 2025 diff hist +911 stat946W25 →Core concepts
- 16:3316:33, 12 March 2025 diff hist +1 stat946W25 →Theorem 2
- 16:2616:26, 12 March 2025 diff hist +1,327 stat946W25 →Key Contributions
- 15:5515:55, 12 March 2025 diff hist −323 stat946W25 →Structured State Space (S4)
- 15:2315:23, 12 March 2025 diff hist +861 stat946W25 →Structured State Space (S4)
- 13:3913:39, 12 March 2025 diff hist +759 stat946W25 →Core concepts
11 March 2025
- 20:5320:53, 11 March 2025 diff hist +1 stat946W25 →TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer
- 20:5120:51, 11 March 2025 diff hist +186 stat946W25 →Key Contributions
- 20:4820:48, 11 March 2025 diff hist +1,065 stat946W25 →Key Contributions
- 20:2620:26, 11 March 2025 diff hist 0 stat946W25 →TRANSNORMERLLM: A Faster and Better Large Language Model with Improved Transformer
- 20:2520:25, 11 March 2025 diff hist +702 stat946W25 →TRANSNORMERLLM: A Faster and Better Large Language Model with Improved Transformer
- 14:2414:24, 11 March 2025 diff hist +23 stat946W25 →Key Contributions
- 14:2114:21, 11 March 2025 diff hist +1,186 stat946W25 →Key Approaches to Linear Attention