AI Deep Explorer | f...ย โขย 2m
Want to learn AI the right way in 2025? Donโt just take courses. Donโt just build toy projects. Look at whatโs actually being used in the real world. The most practical way to really learn AI today is to follow the models that are shaping the industry โ and read the technical papers that power them. Thatโs where you see what works in practice, not just theory. Here's a curated list of the most impactful language models technical paper: 1๏ธโฃGPT Series (OpenAI) GPT-1 โ Improving Language Understanding by Generative Pre-Training(2018) https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf GPT-2 โ Language Models are Unsupervised Multitask Learners (2019) https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf GPT-3 โ Language Models are Few-Shot Learners(2020) https://arxiv.org/abs/2005.14165 ChatGPT :Trained with RLHF โ Reinforcement Learning from Human Feedback (Ouyang et al., 2022) https://arxiv.org/abs/2203.02155 GPT-4 โ GPT-4 Technical Report (2023) https://cdn.openai.com/papers/gpt-4.pdf 2๏ธโฃClaude (Anthropic) Constitutional AI: Harmlessness from AI Feedback (2022) https://arxiv.org/pdf/2212.08073 3๏ธโฃGemini (Google DeepMind) Gemini: A Family of Highly Capable Multimodal Models (2023) https://arxiv.org/abs/2312.11805 Start building with Gemini 2.5 Flash(2025) https://developers.googleblog.com/en/start-building-with-gemini-25-flash/ 4๏ธโฃGemma (Google) Gemma: Open Models for Responsible AI(2024) https://arxiv.org/abs/2403.08295 Gemma 3 Technical Report(2025) https://arxiv.org/abs/2503.19786 5๏ธโฃLLaMA Series (Meta AI) LLaMA: Open and Efficient Foundation Language Models(2023) https://arxiv.org/abs/2302.13971 LLaMA 2: Improved training and safety (2023) https://arxiv.org/pdf/2307.09288 Llama 3:The Llama 3 Herd of Models https://arxiv.org/abs/2407.21783 Llama 4:The beginning of a new era of natively multimodal AI innovation https://ai.meta.com/blog/llama-4-multimodal-intelligence/ 6๏ธโฃMistral AI(France) Mistral 7B: Grouped-query attention (2023) https://arxiv.org/abs/2310.06825 7๏ธโฃKimi by Moonshot AI (China) Scaling RL with LLMs: Technical Report of Kimi k1.5 (2025) https://arxiv.org/abs/2501.12599 8๏ธโฃDeepSeek(China) DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models https://arxiv.org/abs/2402.03300 DeepSeek-V3 Technical Report (2024) https://arxiv.org/pdf/2412.19437 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 9๏ธโฃQwen (China) Qwen Technical Report(2023) https://arxiv.org/pdf/2309.16609 Qwen2 Technical Report(2024) https://arxiv.org/pdf/2407.10671 Qwen2.5 Technical Report(2024) https://arxiv.org/pdf/2412.15115 Qwen2.5-Omni Technical Report Multimodel (2025) https://arxiv.org/pdf/2503.20215 Keep exploring, keep growing, and always give back!
startups, technology...ย โขย 11m
Meta has introduced the Llama 3.1 series of large language models (LLMs), featuring a top-tier model with 405 billion parameters, as well as smaller variants with 70 billion and 8 billion parameters. Meta claims that Llama 3.1 matches the performance
See MoreHey I am on Medialย โขย 11m
Huge announcement from Meta. Welcome Llama 3.1๐ฅ This is all you need to know about it: The new models: - The Meta Llama 3.1 family of multilingual large language models (LLMs) is a collection of pre-trained and instruction-tuned generative models
See Moreย โขย
OpenAIย โขย 5m
AI TOOLS TO CHECKOUT ๐ฅ Pickle: Your AI body double for video calls. https://getpickle.ai/ ๐ฆ Prompt Llama: Gather high-quality text-to-image prompts and test the performance of different models with the same prompts. https://prompt-llama.com/ ๐ค
See MorePython Developer ๐ป ...ย โขย 4m
3B LLM outperforms 405B LLM ๐คฏ Similarly, a 7B LLM outperforms OpenAI o1 & DeepSeek-R1 ๐คฏ ๐คฏ LLM: llama 3 Datasets: MATH-500 & AIME-2024 This has done on research with compute optimal Test-Time Scaling (TTS). Recently, OpenAI o1 shows that Test-
See MoreHey I am on Medialย โขย 1y
Good Morning Everyone ๐, Do you think that, Elon Musk's Grok Model is going to be big competition for models like GPT-4,Claude Models Sonet/Optus, Google's Gemini Model or Grok is lagging behind the competition? According to me, making a Model publ
See MoreDownload the medial app to read full posts, comements and news.