Back

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

Give me 2 minutes, I will tell you How to Learn Reinforcement Learning for LLMs A humorous analogy for reinforcement learning uses cake as an example.ย Reinforcement learning, much like baking a cake, involves trial and error to achieve a desired outcome (a delicious cake) by learning from rewards (delicious cake) and penalties (burnt cake).ย Unsupervised learning is the foundation (the cake itself), supervised learning adds the frosting, and reinforcement learning is the cherry on top, the final touch. โ‡› Most important paper for LLM Reinforcement Learning -ย Asynchronous Deep Reinforcement Learningย (Google Deepmind 2016) https://lnkd.in/gQUK3xmb - Reinforcement Learning from Human (OpenAI 2017) https://lnkd.in/gf5iPfhJ -Proximal Policy Optimization (OpenAI 2017) https://lnkd.in/gAG6As-7 -Fine-Tuning Language Models from Human Preferencesย (OpenAI 2020) https://lnkd.in/gsfxReUg -Learning to Summarize from Human Feedbackย (OpenAI 2022) https://lnkd.in/grUG-XHU -Direct Preference Optimization( Stanford University 2023) https://lnkd.in/gTKSQnCN - Group Relative Policy Optimization ( DeepSeek 2024) https://lnkd.in/gkNRn5sh -reinforcement learning with verifiable rewards (DeepSeek 2025) https://lnkd.in/gcksvi-v โซธ Books for Reinforcement Learning -Reinforcement Learning from Human Feedback (Nathan Lambert) https://lnkd.in/gJW4JmiS -Reinforcement Learning: Industrial Applications (Phil Winder) https://amzn.to/4iufoQz -Reinforcement Learning (Richard S. Sutton) https://amzn.to/4jf0SNv Keep exploring, keep growing, and always give back!

Reply
2

More like this

Recommendations from Medial

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

Having worked on Reinforcement Learning, itโ€™s always fascinating to see how itโ€™s being applied in the world of LLMs. If youโ€™re curious about how RL powers modern LLM agents, especially in areas like reward modeling, and policy gradients here are a f

See More
Reply
1
15
Image Description

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

The best way to learn about LLMs is to read the actual papers that highlight the fundamental ideas behinds LLMs. I'd prob first start off by learning about the attention mechanism which can be detailed in the following paper and try to implement a

See More
1 Reply
6

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

๐—ช๐—ฎ๐—ป๐˜ ๐˜๐—ผ ๐—ฏ๐—ฒ๐—ฐ๐—ผ๐—บ๐—ฒ ๐—ฎ๐—ป ๐—”๐—œ ๐—บ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟ ๐—ถ๐—ป 2025? Learn AI from the ground up with these ๐—™๐—ฅ๐—˜๐—˜ YouTube channels that make it all crystal clear Letโ€™s be honestโ€”YouTube can teach you more about AI than a lot of university degrees, From

See More
Reply
6

Comet

#freelancerย โ€ขย 6m

Mastering LinkedLists: Key Questions You Should Know Easy: ๐Ÿ“Œ Reverse Linked List: https://lnkd.in/g7qP9-YU ๐Ÿ“Œ Merge Two Sorted Lists: https://lnkd.in/gRfC6yyF ๐Ÿ“Œ Remove Nth Node From End of List: https://lnkd.in/gGnGF75X ๐Ÿ“Œ Delete Node in a Linked

See More
Reply
2
Image Description

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

My Favorite AI & ML Books That Shaped My Learning Over the years, Iโ€™ve read tons of books in AI, ML, and LLMs โ€” but these are the ones that stuck with me the most. Each book on this list taught me something new about building, scaling, and underst

See More
1 Reply
1
9

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

The ultimate AI/ML roadmap for beginners ๐Ÿ‘‡ ๐— ๐—ฎ๐˜๐—ต๐˜€ What to learn: โ€ข Linear Algebra โ€ข Calculus โ€ข Statistics Resources: โ€ข Practical Statistics for Data Science( https://amzn.to/446czl5 ) โ€ข Mathematics for Machine Learning( https://amzn.to/441s

See More
Reply
11
13
Image Description
Image Description

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

Old is Gold: Deep Learning Classics In the fast-paced world of AI, itโ€™s easy to overlook the timeless gems that laid the foundation for modern deep learning. Hereโ€™s a curated list of classic, high-quality courses taught by pioneers of the field tha

See More
4 Replies
14
16

AI Engineer

AI Deep Explorer | f...ย โ€ขย 2m

If I were learning RAG from scratch in 2025... here's exactly how I'd do it. But here I bring you the ultimate resource list for absolutely no cost. RAG is one of the most practical, production-ready LLM patterns today. Here's All you need to get s

See More
Reply
3

AI Engineer

AI Deep Explorer | f...ย โ€ขย 3m

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment link: https://bair.berkeley.edu/blog/2025/03/25/rl-av-smoothing/

Reply
2

AI Engineer

AI Deep Explorer | f...ย โ€ขย 3m

Day 1/100 : FREE AI Resource Sharing Topic of Day: History Of Artificial Intelligence(AI) Books โ†ณ"Artificial Intelligence: A Modern Approach" by Stuart Russell and Peter Norvig https://lnkd.in/gzSCYnf9 โ†ณ "The Master Algorithm: How the Quest for t

See More
Reply
6
11

Download the medial app to read full posts, comements and news.