Recommend models for GTX 1660 Super (6GB)

Disonantezko@lemmy.sdf.org · 11 days ago

Recommend models for GTX 1660 Super (6GB)

The Hobbyist@lemmy.zip · 11 days ago

Deepseek is good at reasoning, qwen is good at programming, but I find llama3.1 8b to be well suited for creativity, writing, translations and other tasks which fall out of the scope of your two models. It’s a decent all arounder. It’s about 4.9GB in q4_K_M.

Disonantezko@lemmy.sdf.org · 11 days ago

It’s not out of my scope, I’m just learning what can I do locally with my current machine.

Today I read about RAG, maybe I’m gonna try an easy local setup to chat with a PDF.

Possibly linux@lemmy.zip · 10 days ago

Mistral

I personally run models on my laptop. I have 48 GB of ram and a i5-12500U. It runs a little slow but usable

Disonantezko@lemmy.sdf.org · 10 days ago

My gear is an old:

I7-4790 16GB RAM

How many tokens by second?

Possibly linux@lemmy.zip · 10 days ago

The biggest bottleneck is going to be memory. I would just stick with GPU only since your GPU memory has the most bandwidth.