Back

More like this

Recommendations from Medial

Image Description

Varun reddy

 • 

GITAM • 1y

Fine-Tuning: The Secret Sauce of AI Magic! Ever wonder how AI gets so smart? It’s all about fine-tuning! Imagine a pre-trained model as a genius with general knowledge. 🧠✨ Fine-tuning takes that genius and hones its skills for a specific task, li

See More
1 Reply
4

Kimiko

Startups | AI | info... • 1m

X updates its developer agreement to ban third parties from using the X API or X Content for training or fine-tuning foundation or frontier AI models.

Reply
12

AI Engineer

AI Deep Explorer | f... • 3m

"A Survey on Post-Training of Large Language Models" This paper systematically categorizes post-training into five major paradigms: 1. Fine-Tuning 2. Alignment 3. Reasoning Enhancement 4. Efficiency Optimization 5. Integration & Adaptation 1️⃣ Fin

See More
Reply
1
8

Swami Gadila

Founder of Friday AI • 27d

Testing our Voice Clone Model in Friday — built just last night! Fingers crossed it performs well at the bench targets today. Let’s make this moment count.❤

Reply
5

Aditya Karnam

Hey I am on Medial • 4m

"Just fine-tuned LLaMA 3.2 using Apple's MLX framework and it was a breeze! The speed and simplicity were unmatched. Here's the LoRA command I used to kick off training: ``` python lora.py \ --train \ --model 'mistralai/Mistral-7B-Instruct-v0.2' \ -

See More
Reply
1

Gigaversity

Gigaversity.in • 9d

Overfitting, underfitting, and fitting — these aren't just technical terms, but critical checkpoints in every machine learning workflow. Understanding these concepts is key to evaluating model behavior, improving generalization, and building solutio

See More
Reply
4
Image Description
Image Description

Chamarti Sreekar

Passionate about Pos... • 5m

Another open-source model has arrived, and it’s even better than DeepSeek-V3. The Allen Institute for AI just introduced Tülu 3 (405B) 🐫, a post-training model that is a fine-tune of Llama 3.1 405B, which outperforms DeepSeek V3.

10 Replies
14
29

AI Engineer

AI Deep Explorer | f... • 3m

LLM Post-Training: A Deep Dive into Reasoning LLMs This survey paper provides an in-depth examination of post-training methodologies in Large Language Models (LLMs) focusing on improving reasoning capabilities. While LLMs achieve strong performance

See More
Reply
2

Comet

#freelancer • 7m

10 Best LLM Tools to Simplify Your Workflow 1️⃣ LangChain LangChain's flexibility suits complex AI and multi-stage processing. 2️⃣ Cohere Cohere offers integration, scalability, and customization. 3️⃣ Falcon Falcon offers cost-effective, high-per

See More
Reply
1
2

Download the medial app to read full posts, comements and news.