What should I use: big model-small quant or small model-no quant?

Smorty [she/her]@lemmy.blahaj.zone · edit-2 5 days ago

What should I use: big model-small quant or small model-no quant?

j4k3@lemmy.world · edit-2 12 days ago

deleted by creator

Smorty [she/her]@lemmy.blahaj.zone · 12 days ago

Another user @SGforce@lemmy.ca commented about there being a way to split it between GPU and CPU. Are you talking about this nvidia only and windows only thingy, which only works with the proprietary driver? If so, I’m really not gonna use that…

Have you tried some of the abliterated models? They work really nicely even for the spiciest of topics. They literally can’t refuse your instruction, so they just go ahead and do what you want. But maybe even these models are too narrow for your specific application…

j4k3@lemmy.world · 12 days ago

deleted by creator