AI Deep Explorer | f...Ā ā¢Ā 2m
The best way to learn about LLMs is to read the actual papers that highlight the fundamental ideas behinds LLMs. I'd prob first start off by learning about the attention mechanism which can be detailed in the following paper and try to implement a vanilla transformer: https://lnkd.in/eV6NcXx8 Once you have that down I would probably aim to learn about earlier models and try to implement them. Here are a couple models that are good examples to try to implement from scratch (Karpathy and Umar has good videos for them if you get stuck or just want general overview): ābert: https://lnkd.in/eU7F324a āgpt: https://lnkd.in/eAzaDsP5 āgpt2: https://lnkd.in/ehjhXveV After you gone through all of these, you should have a decent foundation for the very basics and start reading papers in the areas you are interested in. If you don't know where to start these are some suggestions for papers I think that are interesting and worth considering reading: āScaling laws: https://lnkd.in/gdWbBH8i āLora: https://lnkd.in/eW-V4Dcq āMixture of experts: https://lnkd.in/eM5ngGSj āReinforcement Learning from Human Feedback : https://rlhfbook.com/
AI Deep Explorer | f...Ā ā¢Ā 2m
Day 1/100 : FREE AI Resource Sharing Topic of Day: History Of Artificial Intelligence(AI) Books ā³"Artificial Intelligence: A Modern Approach" by Stuart Russell and Peter Norvig https://lnkd.in/gzSCYnf9 ā³ "The Master Algorithm: How the Quest for t
See MoreDownload the medial app to read full posts, comements and news.