Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
GabrielBianconi
6 months ago
|
parent
|
context
|
favorite
| on:
Fine-tuned small LLMs can beat large ones with pro...
With supervised fine-tuning (SFT), you'll often see good results with 100-1000+ datapoints (they can be variations of the same prompt template). If you have more limited data, reinforcement fine-tuning (RFT) can work well in the 10-100 range.
Good luck!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Good luck!