User contributions for Z6zhen
Jump to navigation
Jump to search
6 April 2025
- 05:0305:03, 6 April 2025 diff hist 0 stat946W25 →Evaluation and Results
- 05:0205:02, 6 April 2025 diff hist +1 stat946W25 →Evaluation and Results
- 04:5804:58, 6 April 2025 diff hist +47 stat946W25 →Evaluation and Results
- 04:5404:54, 6 April 2025 diff hist +95 stat946W25 →Evaluation and Results
- 04:5204:52, 6 April 2025 diff hist +70 N File:Diffusion-LM Improves Controlable Text Generation result.png Diffusion-LM Improves Controlable Text Generation result current
31 March 2025
- 04:5804:58, 31 March 2025 diff hist +923 stat946W25 →Topic 7: Dynamic Models: Many-in-One Language Models
- 04:1504:15, 31 March 2025 diff hist +490 stat946W25 →FLEXTRON: Many-in-One Flexible Large Language Model
- 04:1304:13, 31 March 2025 diff hist −531 stat946W25 →Flextron: Many-in-One Flexible Large Language Model
- 03:5003:50, 31 March 2025 diff hist +6 stat946W25 →Introducing Optimal Brain Quantization
- 03:4903:49, 31 March 2025 diff hist +85 stat946W25 →Method
- 03:4903:49, 31 March 2025 diff hist −72 stat946W25 →GPTQ: An Improved OBQ
- 03:4803:48, 31 March 2025 diff hist +14 stat946W25 →GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
- 03:4803:48, 31 March 2025 diff hist −18 stat946W25 →Introducing Optimal Brain Quantization
- 03:4703:47, 31 March 2025 diff hist −2 stat946W25 →GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
- 03:4603:46, 31 March 2025 diff hist +49 stat946W25 →Introducing Optimal Brain Quantization
- 03:4503:45, 31 March 2025 diff hist −11 stat946W25 →Introducing Optimal Brain Quantization
- 03:4303:43, 31 March 2025 diff hist −4 stat946W25 →GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
- 03:4003:40, 31 March 2025 diff hist +9 stat946W25 →GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
- 03:3403:34, 31 March 2025 diff hist +191 stat946W25 →Topic 4: Quantization
- 03:2703:27, 31 March 2025 diff hist +24 stat946W25 →Topic 4: Quantization
- 03:2403:24, 31 March 2025 diff hist −49 stat946W25 →Topic 12: State Space Models
- 03:2103:21, 31 March 2025 diff hist −6 stat946W25 →Topic 12: State Space Models
- 03:1703:17, 31 March 2025 diff hist +849 stat946W25 →Introduction
- 02:5602:56, 31 March 2025 diff hist +51 stat946W25 →Hardware-Efficient Long Convolutions
- 02:4402:44, 31 March 2025 diff hist +44 stat946W25 →Flash Fast Fourier Transform Convolution (FlashFFT Conv)
- 02:4102:41, 31 March 2025 diff hist +2 stat946W25 →Flash Attention V1
- 02:4002:40, 31 March 2025 diff hist −6 stat946W25 →Flash Attention V1
- 02:3402:34, 31 March 2025 diff hist +741 stat946W25 →Results
- 02:1902:19, 31 March 2025 diff hist +347 stat946W25 →Results
11 March 2025
- 02:0102:01, 11 March 2025 diff hist +1,361 stat946W25 →Topic 12: State Space Models