User contributions for S232zhen
Jump to navigation
Jump to search
12 April 2025
- 02:0602:06, 12 April 2025 diff hist −1 stat946W25 →Summary and Future Directions
- 02:0602:06, 12 April 2025 diff hist +2,473 stat946W25 →Graph-RAG for Document Set Summarization
- 01:4301:43, 12 April 2025 diff hist −1,769 stat946W25 →Topic 18: Retrival Augmented Generation (RAG)
- 01:3001:30, 12 April 2025 diff hist −59 stat946W25 →Summary and Future Directions
- 01:2701:27, 12 April 2025 diff hist −24 stat946W25 →Topic 18: Retrival Augmented Generation (RAG)
- 00:3900:39, 12 April 2025 diff hist +1,016 stat946W25 →Topic 19: MM-LLMs
- 00:3100:31, 12 April 2025 diff hist +567 stat946W25 →Summary Table of Multimodal LLMs
- 00:2900:29, 12 April 2025 diff hist +1,533 stat946W25 →Topic 19: MM-LLMs
3 April 2025
- 09:5809:58, 3 April 2025 diff hist +2,000 stat946W25 →Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
- 09:3809:38, 3 April 2025 diff hist +580 stat946W25 →Background
- 09:3009:30, 3 April 2025 diff hist +242 stat946W25 →Background
- 09:1109:11, 3 April 2025 diff hist +8 stat946W25 →Topic 6: KV Cache Compression
2 April 2025
- 20:5420:54, 2 April 2025 diff hist +3,924 stat946W25 →Compact Language Models via Pruning and Knowledge Distillation
- 14:1914:19, 2 April 2025 diff hist +34 stat946W25 →Compact Language Models via Pruning and Knowledge Distillation
- 14:1114:11, 2 April 2025 diff hist −39 stat946W25 →Compact Language Models via Pruning and Knowledge Distillation
- 13:5613:56, 2 April 2025 diff hist +26 stat946W25 →Topic 5: KD / Pruning / Sharing
13 March 2025
- 11:4211:42, 13 March 2025 diff hist +1 stat946W25 →Taylor Linear Attention
- 11:4111:41, 13 March 2025 diff hist +1,580 stat946W25 →Simple linear attention language models balance the recall-throughput tradeoff
- 11:2711:27, 13 March 2025 diff hist +492 stat946W25 →Simple linear attention language models balance the recall-throughput tradeoff
- 11:1011:10, 13 March 2025 diff hist −49 stat946W25 →Topic 10: Linear Attention
12 March 2025
- 01:4001:40, 12 March 2025 diff hist +365 stat946W25 →H3 Design
- 01:3301:33, 12 March 2025 diff hist +1,212 stat946W25 →State Space Duality (SSD)
- 01:1401:14, 12 March 2025 diff hist +655 stat946W25 →Topic 12: State Space Models