Another open-source model has arrived, and itâs even better than DeepSeek-V3.
The Allen Institute for AI just introduced TĂźlu 3 (405B) đŤ, a post-training model that is a fine-tune of Llama 3.1 405B, which outperforms DeepSeek V3.
Anonymous 3
Hey I am on Medial â˘Â 2m
But if people are fine tuning their models on top of the existing models then toh its not a good thing na? Like imagine training on AI output...thought that was recursion or something.
0 replies
More like this
Recommendations from Medial
Jainil Prajapati
Turning dreams into ... â˘Â 1m
India should focus on fine-tuning existing AI models and building applications rather than investing heavily in foundational models or AI chips, says Groq CEO Jonathan Ross.
Is this the right strategy for India to lead in AI innovation? Thoughts?
2 replies3 likes
Varun reddy
 â˘Â
GITAM â˘Â 10m
Fine-Tuning: The Secret Sauce of AI Magic!
Ever wonder how AI gets so smart? Itâs all about fine-tuning!
Imagine a pre-trained model as a genius with general knowledge. đ§ ⨠Fine-tuning takes that genius and hones its skills for a specific task, li
See More
1 replies4 likes
Bhoop singh Gurjar
AI Deep Explorer | f... â˘Â 2d
"A Survey on Post-Training of Large Language Models"
This paper systematically categorizes post-training into five major paradigms:
1. Fine-Tuning
2. Alignment
3. Reasoning Enhancement
4. Efficiency Optimization
5. Integration & Adaptation
1ď¸âŁ Fin
Working on a cool AI project, that involves vector db and LLM fine-tuning
0 replies2 likes
Ayush Maurya
AI Pioneer â˘Â 3m
BREAKTHROUGH INSIGHT:
Most people train AI models.
Smart people fine-tune AI models.
But the real secret?
Learning to dance with AI's existing knowledge.
Stop forcing. Start flowing.
0 replies3 likes
Yogesh Jamdade
..... â˘Â 9m
Hey everyone, I'm an engineering student geeking out over Generative AI. Loving LangChain, Hugging Face models, Crew.ai's chatbots, fine-tuning, and RAG. Plus, machine learning and data science are pretty cool too!
Anyone else into this stuff? Looki
See More
0 replies3 likes
Yogesh Jamdade
..... â˘Â 9m
Hey everyone, I'm an engineering student geeking out over Generative AI. Loving LangChain, Hugging Face models, Crew.ai's chatbots, fine-tuning, and RAG. Plus, machine learning and data science are pretty cool too!
Anyone else into this stuff? Looki
đĄAn Idea to Change the Game for AI Startups: Making AI Processing Faster, Cheaper, and Effortless
Running AI models like ChatGPT, DALL¡E, or AlphaCode is a computing monsterâthey need massive power to function, which makes them expensive to operate
See More
2 replies4 likes
Zaki Aslam
Dope â˘Â 23d
Out of all the startups and professionals here, how many are actively working on developing their own AI models? And how many are working on developing apps, tools or services based on existing AI models
"Just fine-tuned LLaMA 3.2 using Apple's MLX framework and it was a breeze! The speed and simplicity were unmatched. Here's the LoRA command I used to kick off training:
```
python lora.py \
--train \
--model 'mistralai/Mistral-7B-Instruct-v0.2' \
-