🚀 Medial Secures Investment on Shark Tank India - Fueling the Future of Professional Social Networking. 🔥

News

Messages

Try our Valuation Calculator →

Back

AI Engineer

AI Deep Explorer | f... • 10m

Give me 2 minutes, I will tell you How to Learn Reinforcement Learning for LLMs A humorous analogy for reinforcement learning uses cake as an example. Reinforcement learning, much like baking a cake, involves trial and error to achieve a desired outcome (a delicious cake) by learning from rewards (delicious cake) and penalties (burnt cake). Unsupervised learning is the foundation (the cake itself), supervised learning adds the frosting, and reinforcement learning is the cherry on top, the final touch. ⇛ Most important paper for LLM Reinforcement Learning - Asynchronous Deep Reinforcement Learning (Google Deepmind 2016) https://lnkd.in/gQUK3xmb - Reinforcement Learning from Human (OpenAI 2017) https://lnkd.in/gf5iPfhJ -Proximal Policy Optimization (OpenAI 2017) https://lnkd.in/gAG6As-7 -Fine-Tuning Language Models from Human Preferences (OpenAI 2020) https://lnkd.in/gsfxReUg -Learning to Summarize from Human Feedback (OpenAI 2022) https://lnkd.in/grUG-XHU -Direct Preference Optimization( Stanford University 2023) https://lnkd.in/gTKSQnCN - Group Relative Policy Optimization ( DeepSeek 2024) https://lnkd.in/gkNRn5sh -reinforcement learning with verifiable rewards (DeepSeek 2025) https://lnkd.in/gcksvi-v ⫸ Books for Reinforcement Learning -Reinforcement Learning from Human Feedback (Nathan Lambert) https://lnkd.in/gJW4JmiS -Reinforcement Learning: Industrial Applications (Phil Winder) https://amzn.to/4iufoQz -Reinforcement Learning (Richard S. Sutton) https://amzn.to/4jf0SNv Keep exploring, keep growing, and always give back!

Recommendations from Medial

AI Engineer

AI Deep Explorer | f... • 10m

Having worked on Reinforcement Learning, it’s always fascinating to see how it’s being applied in the world of LLMs. If you’re curious about how RL powers modern LLM agents, especially in areas like reward modeling, and policy gradients here are a f

AI Engineer

AI Deep Explorer | f... • 11m

The best way to learn about LLMs is to read the actual papers that highlight the fundamental ideas behinds LLMs. I'd prob first start off by learning about the attention mechanism which can be detailed in the following paper and try to implement a

1 Reply

AI Engineer

AI Deep Explorer | f... • 11m

My Favorite AI & ML Books That Shaped My Learning Over the years, I’ve read tons of books in AI, ML, and LLMs — but these are the ones that stuck with me the most. Each book on this list taught me something new about building, scaling, and underst

1 Reply

AI Engineer

AI Deep Explorer | f... • 10m

The ultimate AI/ML roadmap for beginners 👇 𝗠𝗮𝘁𝗵𝘀 What to learn: • Linear Algebra • Calculus • Statistics Resources: • Practical Statistics for Data Science( https://amzn.to/446czl5 ) • Mathematics for Machine Learning( https://amzn.to/441s

Amit Sharma

Lead Data Analyst @ ... • 7m

🚨 AI is replacing jobs at a scary speed. If you’re not learning it, you’re falling behind. Here are 10 YouTube channels that will make you AI-savvy fast: 1. 🎓 DeepLearning AI – Andrew Ng’s practical courses 🔗 https://lnkd.in/gucHYZrq 2. ⚡ Two

AI Engineer

AI Deep Explorer | f... • 11m

𝗪𝗮𝗻𝘁 𝘁𝗼 𝗯𝗲𝗰𝗼𝗺𝗲 𝗮𝗻 𝗔𝗜 𝗺𝗮𝘀𝘁𝗲𝗿 𝗶𝗻 2025? Learn AI from the ground up with these 𝗙𝗥𝗘𝗘 YouTube channels that make it all crystal clear Let’s be honest—YouTube can teach you more about AI than a lot of university degrees, From

AI Engineer

AI Deep Explorer | f... • 10m

Old is Gold: Deep Learning Classics In the fast-paced world of AI, it’s easy to overlook the timeless gems that laid the foundation for modern deep learning. Here’s a curated list of classic, high-quality courses taught by pioneers of the field tha

4 Replies

Comet

#freelancer • 1y

Mastering LinkedLists: Key Questions You Should Know Easy: 📌 Reverse Linked List: https://lnkd.in/g7qP9-YU 📌 Merge Two Sorted Lists: https://lnkd.in/gRfC6yyF 📌 Remove Nth Node From End of List: https://lnkd.in/gGnGF75X 📌 Delete Node in a Linked

AI Engineer

AI Deep Explorer | f... • 11m

If I were learning RAG from scratch in 2025... here's exactly how I'd do it. But here I bring you the ultimate resource list for absolutely no cost. RAG is one of the most practical, production-ready LLM patterns today. Here's All you need to get s