User contributions for Aelmancy
Jump to navigation
Jump to search
10 April 2025
- 22:3122:31, 10 April 2025 diff hist +436 stat946W25 →Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
- 03:2603:26, 10 April 2025 diff hist +250 stat946W25 →Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
- 01:5701:57, 10 April 2025 diff hist +102 stat946W25 →Topic 18: Retrival Augmented Generation (RAG)
3 April 2025
- 07:1807:18, 3 April 2025 diff hist 0 stat946W25 →Stage Two: Learning the Prior
- 07:1807:18, 3 April 2025 diff hist 0 stat946W25 →Generation
- 07:1007:10, 3 April 2025 diff hist +122 stat946W25 →Topic 19: MM-LLMs
- 07:0607:06, 3 April 2025 diff hist +379 stat946W25 →Zero-Shot Text-to-Image Generation
- 07:0007:00, 3 April 2025 diff hist 0 N File:dalle 5.png No edit summary current
- 06:4606:46, 3 April 2025 diff hist 0 stat946W25 →Zero-Shot Text-to-Image Generation
- 06:4506:45, 3 April 2025 diff hist +624 stat946W25 →Zero-Shot Text-to-Image Generation
- 06:4106:41, 3 April 2025 diff hist +89 N File:dalle 4.webp source: https://medium.com/@zaiinn440/how-openais-dall-e-works-da24ac6c12fa current
- 06:1106:11, 3 April 2025 diff hist +25 stat946W25 →Zero-Shot Text-to-Image Generation
- 05:0005:00, 3 April 2025 diff hist +209 stat946W25 →Zero-Shot Text-to-Image Generation
- 04:5104:51, 3 April 2025 diff hist +89 N File:dalle 3.webp source: https://medium.com/@zaiinn440/how-openais-dall-e-works-da24ac6c12fa current
- 04:5104:51, 3 April 2025 diff hist +89 N File:dalle 2.webp source: https://medium.com/@zaiinn440/how-openais-dall-e-works-da24ac6c12fa current
- 04:4304:43, 3 April 2025 diff hist +61 stat946W25 →Zero-Shot Text-to-Image Generation
- 04:4004:40, 3 April 2025 diff hist +89 N File:dalle 1.webp source: https://medium.com/@zaiinn440/how-openais-dall-e-works-da24ac6c12fa current
- 04:3804:38, 3 April 2025 diff hist +877 stat946W25 →Zero-Shot Text-to-Image Generation
- 01:1401:14, 3 April 2025 diff hist +81 stat946W25 →Zero-Shot Text-to-Image Generation
- 01:0801:08, 3 April 2025 diff hist +856 stat946W25 →Topic 19: MM-LLMs
25 March 2025
- 22:3022:30, 25 March 2025 diff hist +264 stat946W25 →Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
- 21:5421:54, 25 March 2025 diff hist +59 stat946W25 →Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
- 21:5121:51, 25 March 2025 diff hist 0 N File:dynamic pruning 1.png No edit summary current
- 21:5021:50, 25 March 2025 diff hist +128 stat946W25 →Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers
- 21:4421:44, 25 March 2025 diff hist +1,711 stat946W25 →Topic 6: KV Cache Compression
20 March 2025
- 21:4021:40, 20 March 2025 diff hist +177 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
- 21:3821:38, 20 March 2025 diff hist 0 N File:echoatt results3.png No edit summary current
- 21:3721:37, 20 March 2025 diff hist −1 stat946W25 →Results
- 21:3721:37, 20 March 2025 diff hist +3 stat946W25 No edit summary
- 21:3421:34, 20 March 2025 diff hist +1 stat946W25 →Results
- 21:3321:33, 20 March 2025 diff hist +4 stat946W25 →Results
- 21:3221:32, 20 March 2025 diff hist +468 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
- 21:2521:25, 20 March 2025 diff hist 0 N File:echoatt results2.png No edit summary current
- 21:2221:22, 20 March 2025 diff hist 0 N File:echoatt results1.png No edit summary current
19 March 2025
- 20:5720:57, 19 March 2025 diff hist +384 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
- 20:3720:37, 19 March 2025 diff hist +17 stat946W25 →Distillation with teacher's Pseudo-Labels
- 20:3620:36, 19 March 2025 diff hist −1 stat946W25 →Knowledge Distillation
- 20:3620:36, 19 March 2025 diff hist +250 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
- 20:3420:34, 19 March 2025 diff hist +1,935 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
18 March 2025
- 20:4320:43, 18 March 2025 diff hist +994 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
- 19:5619:56, 18 March 2025 diff hist +801 stat946W25 →EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
- 19:4019:40, 18 March 2025 diff hist +310 stat946W25 →Topic 5: KD / Pruning / Sharing
- 19:3219:32, 18 March 2025 diff hist +55 N File:echoatt.png https://doi.org/10.48550/arXiv.2409.14595 current
- 19:3019:30, 18 March 2025 diff hist +1,888 stat946W25 →Topic 5: KD / Pruning / Sharing
13 March 2025
- 22:1322:13, 13 March 2025 diff hist +21 stat946W25 →Summary & Key Takeaways
- 22:0822:08, 13 March 2025 diff hist +94 stat946W25 →Summary & Key Takeaways
- 22:0622:06, 13 March 2025 diff hist 0 stat946W25 →Topic 12: State Space Models
- 22:0422:04, 13 March 2025 diff hist +1,227 stat946W25 →Topic 12: State Space Models
- 18:2118:21, 13 March 2025 diff hist 0 stat946W25 →Key Approaches to Linear Attention
- 18:2118:21, 13 March 2025 diff hist +10 stat946W25 →Retentive Network (RetNet): A Successor to Transformer for Large Language Models