Founder | Agentic AI...ย โขย 26d
All LLMs are LMs, but not all LMs are LLMs. Most people still get confused. I've explained below. โข ๐๐ ๐ (๐๐ฎ๐ป๐ด๐๐ฎ๐ด๐ฒ ๐ ๐ผ๐ฑ๐ฒ๐น๐): These are models that can process and generate human language. They can be small or medium-sized and may not require huge datasets. โข ๐๐๐ ๐ (๐๐ฎ๐ฟ๐ด๐ฒ ๐๐ฎ๐ป๐ด๐๐ฎ๐ด๐ฒ ๐ ๐ผ๐ฑ๐ฒ๐น๐): These are a ๐๐ฝ๐ฒ๐ฐ๐ถ๐ณ๐ถ๐ฐ ๐๐๐ฝ๐ฒ of LM, but much ๐น๐ฎ๐ฟ๐ด๐ฒ๐ฟ in scale. LLMs like GPT-3 or GPT-4 are trained on massive datasets, have billions (or even trillions) of parameters. An ๐๐๐ is a ๐๐๐ฏ๐๐ฒ๐ ๐ผ๐ณ ๐๐ that is: โข Very large in size โข Trained on massive datasets โข Based on deep neural networks (Transformers) โข Capable of reasoning, coding, summarizing, etc. Types of LM's: ๐๐ ๐ฆ๐ถ๐๐ฒ / ๐ฆ๐ฐ๐ฎ๐น๐ฒ 1. ๐ฆ๐บ๐ฎ๐น๐น ๐๐ ๐ โข Lightweight, fast, low-cost models with limited intelligence. 2. ๐ ๐ฒ๐ฑ๐ถ๐๐บ ๐๐ ๐ โข Balanced speed and accuracy, suitable for most production systems. 3. ๐๐ฎ๐ฟ๐ด๐ฒ ๐๐ ๐ โข High-capacity models with strong reasoning, powerful but expensive. ______________ ๐๐ ๐จ๐๐ฎ๐ด๐ฒ 1. ๐๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐น-๐ฝ๐๐ฟ๐ฝ๐ผ๐๐ฒ ๐๐ ๐ โข Designed to handle many tasks โข Chat, writing, coding, reasoning 2. ๐๐ผ๐บ๐ฎ๐ถ๐ป-๐๐ฝ๐ฒ๐ฐ๐ถ๐ณ๐ถ๐ฐ ๐๐ ๐ โข Trained or tuned for one field โข Legal, finance, medical, etc. โข More accurate in narrow domains 3. ๐๐ฑ๐ด๐ฒ ๐๐ ๐ โข Run locally on devices โข Privacy-friendly โข Limited power due to size ______________ ๐๐ ๐ง๐ฟ๐ฎ๐ถ๐ป๐ถ๐ป๐ด ๐ฆ๐๐๐น๐ฒ 1. ๐ฃ๐ฟ๐ฒ-๐๐ฟ๐ฎ๐ถ๐ป๐ฒ๐ฑ โข Trained on general internet-scale data โข Base intelligence layer 2. ๐๐ถ๐ป๐ฒ-๐ง๐๐ป๐ฒ๐ฑ โข Adapted for specific tasks or domains โข Improves accuracy and usefulness 3. ๐๐ป๐๐๐ฟ๐๐ฐ๐๐ถ๐ผ๐ป-๐๐๐ป๐ฒ๐ฑ โข Optimized to follow user instructions โข This is what ChatGPT-style models are Most people just know about LLM's but it's important to know such fundamentals. โ Repost for others so they can also know this fundamental difference.

AI Deep Explorer | f...ย โขย 9m
LLM Post-Training: A Deep Dive into Reasoning LLMs This survey paper provides an in-depth examination of post-training methodologies in Large Language Models (LLMs) focusing on improving reasoning capabilities. While LLMs achieve strong performance
See MoreAI Deep Explorer | f...ย โขย 10m
"A Survey on Post-Training of Large Language Models" This paper systematically categorizes post-training into five major paradigms: 1. Fine-Tuning 2. Alignment 3. Reasoning Enhancement 4. Efficiency Optimization 5. Integration & Adaptation 1๏ธโฃ Fin
See More

Hey I am on Medialย โขย 1y
Huge announcement from Meta. Welcome Llama 3.1๐ฅ This is all you need to know about it: The new models: - The Meta Llama 3.1 family of multilingual large language models (LLMs) is a collection of pre-trained and instruction-tuned generative models
See More
ย โขย
YouTubeย โขย 1y
Researchers at Google DeepMind introduced Semantica, an image-conditioned diffusion model capable of generating images based on the semantics of a conditioning image. The paper explores adapting image generative models to different datasets. Instea
See MoreFounder | Agentic AI...ย โขย 2m
SLM vs LLM โ which AI model is best for you? Iโve explained both in simple steps below. ๐ฆ๐๐ (๐ฆ๐บ๐ฎ๐น๐น ๐๐ฎ๐ป๐ด๐๐ฎ๐ด๐ฒ ๐ ๐ผ๐ฑ๐ฒ๐น) (๐ด๐ต๐ฆ๐ฑ-๐ฃ๐บ-๐ด๐ต๐ฆ๐ฑ) Lightweight AI models built for speed, focus, and on-device execution. 1. ๐๐ฒ๐ณ๐ถ๐ป๐ฒ
See More
Hey I am on Medialย โขย 1y
Weekly AI Roundup : Cost Efficient Models, Advances in Robotics & Cutting edge AI tools OpenAI Unveils GPT-4o Mini: OpenAI's GPT-4o mini is a cost-efficient model aimed at expanding AI accessibility, offering a significant price reduction compared
See More
Hey I am on Medialย โขย 11m
Indiaโs healthcare sector largely relies on foreign datasets for AI-driven medical research and development. This dependency arises due to the lack of a centralized, large-scale, and high-quality indigenous healthcare dataset. Most AI models in healt
See MoreFounder | Agentic AI...ย โขย 9d
Most people overlook these basics of AI Agents. I've explained it in a very simple way below. 1. ๐๐ ๐๐ด๐ฒ๐ป๐ An AI system that observes its environment, information, makes decisions, and takes actions to achieve a goal. 2. ๐๐๐ ๐ (๐๐ฎ๐ฟ๐ด๐ฒ
See More
AI Deep Explorer | f...ย โขย 9m
Want to learn AI the right way in 2025? Donโt just take courses. Donโt just build toy projects. Look at whatโs actually being used in the real world. The most practical way to really learn AI today is to follow the models that are shaping the indus
See MoreDownload the medial app to read full posts, comements and news.