undefined | Better HN

0 pointsnwienert1y ago0 comments

It feels like there should be a service where I just drag drop a folder of examples and it fine tunes the latest DeepSeek or whatever for me and even can host it for me at some cost. I'd pay for that immediately, but last I checked there was nothing that really did that well (would love to be wrong).

0 comments

arkmm1y ago

There are some options out there, depending on what type of task you're trying to fine tune. I think RL finetuning for DeepSeek e.g. isn't well developed yet, but you can finetune a small LLama model (~3B params) for classification or extraction tasks and it works really well. What sort of tasks were you looking at finetuning for?

nwienertOP1y ago

Code generation or question answering. But ideally 70+B

j / k navigate · click thread line to collapse

0 pointsnwienert1y ago0 comments

0 comments

arkmm1y ago

nwienertOP1y ago

Code generation or question answering. But ideally 70+B

j / k navigate · click thread line to collapse