neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16 | | Oct 18 2024 | |
neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit | | Oct 15 2024 | |
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8 | | Oct 20 2024 | |
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8 | | Oct 19 2024 | |
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 | | Oct 19 2024 | |
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 | | Oct 19 2024 | |
neuralmagic/Mistral-Nemo-Instruct-2407-quantized.w4a16 | | Oct 19 2024 | |
neuralmagic/Llama-3.2-11B-Vision-Instruct-FP8-dynamic | | Oct 21 2024 | |
neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV | | Oct 19 2024 | |
neuralmagic/Meta-Llama-3-8B-Instruct-FP8 | | Oct 19 2024 | |
neuralmagic/Llama-3.2-90B-Vision-Instruct-FP8-dynamic | | Oct 19 2024 | |
neuralmagic/Mistral-Nemo-Instruct-2407-FP8 | | Oct 20 2024 | |
neuralmagic/Meta-Llama-3-70B-Instruct-FP8 | | Oct 20 2024 | |
neuralmagic/DeepSeek-Coder-V2-Lite-Instruct-FP8 | | Oct 20 2024 | |
neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin | | Oct 21 2024 | |
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 | | Oct 17 2024 | |
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8-dynamic | | Oct 20 2024 | |
neuralmagic/Phi-3-medium-128k-instruct-quantized.w4a16 | | Oct 19 2024 | |
neuralmagic/Llama-3.2-3B-Instruct-FP8 | | Oct 17 2024 | |
neuralmagic/gemma-2-9b-it-FP8 | | Oct 19 2024 | |