I took a practice test (math) and would like to have it be graded by a LLM since I can’t find the key online. I have 20GB VRAM, but I’m on intel Arc so I can’t do gemma3. I would prefer models from ollama.com 'cause I’m not deep enough down the rabbit hole to try huggingface stuff yet and don’t have time to right now.
If you were down to use hugging face DeepHeremes is a reasoning model built on top of Mistral Small 24b. It’d fit decently well in 20GB.
Maybe the
ollama run hf.co/{username}/{repository}
command would make it easy enough for you.Reasoning models usually are better for math.