Founder | Agentic AI...ย โขย 2m
3 transformer architectures everyone should know. I've explained it in a simple way below. 1. ๐๐ฒ๐ฐ๐ผ๐ฑ๐ฒ๐ฟ-๐ข๐ป๐น๐ ๐ ๐ผ๐ฑ๐ฒ๐น๐ These are mainly used for ๐๐ฒ๐ ๐ ๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ผ๐ป (like ChatGPT). They predict the ๐ป๐ฒ๐ ๐ ๐๐ผ๐ธ๐ฒ๐ป ๐๐๐ฒ๐ฝ ๐ฏ๐ ๐๐๐ฒ๐ฝ. โข ๐ง๐ผ๐ธ๐ฒ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Break input text into smaller tokens โข ๐ฃ๐ผ๐๐ถ๐๐ถ๐ผ๐ป๐ฎ๐น ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด: Add position info so the model understands word order โข ๐ฆ๐ฒ๐น๐ณ-๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Each token looks at previous tokens for context โข ๐ค๐๐ฒ๐ฟ๐/๐๐ฒ๐/๐ฉ๐ฎ๐น๐๐ฒ: Mechanism used to measure token relationships โข ๐๐ฎ๐๐๐ฎ๐น ๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Tokens can only see earlier tokens, not future ones โข ๐๐ฒ๐ฒ๐ฑ๐ณ๐ผ๐ฟ๐๐ฎ๐ฟ๐ฑ ๐น๐ฎ๐๐ฒ๐ฟ: Neural layer refines token representations โข ๐๐ผ๐ป๐๐ฒ๐ ๐ ๐๐ฝ๐ฑ๐ฎ๐๐ฒ: Model refreshes its internal understanding โข ๐ก๐ฒ๐ ๐ ๐๐ผ๐ธ๐ฒ๐ป ๐ฝ๐ฟ๐ฒ๐ฑ๐ถ๐ฐ๐๐ถ๐ผ๐ป: Predict the most probable next word โข ๐๐๐๐ผ๐ฟ๐ฒ๐ด๐ฟ๐ฒ๐๐๐ถ๐๐ฒ ๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ผ๐ป: Repeat prediction until the full response is formed โข ๐ข๐๐๐ฝ๐๐ ๐๐ฒ๐พ๐๐ฒ๐ป๐ฐ๐ฒ: Generated text becomes the final result ___________ 2. ๐๐ป๐ฐ๐ผ๐ฑ๐ฒ๐ฟ-๐ข๐ป๐น๐ ๐ ๐ผ๐ฑ๐ฒ๐น๐ They focus on ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐๐ต๐ฎ๐ ๐๐ป๐ฑ๐ฒ๐ฟ๐๐๐ฎ๐ป๐ฑ ๐๐ฒ๐ ๐ rather than generating it, like classification, embeddings, and search tasks. โข ๐ง๐ผ๐ธ๐ฒ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Convert text into tokens โข ๐๐บ๐ฏ๐ฒ๐ฑ๐ฑ๐ถ๐ป๐ด ๐น๐ฎ๐๐ฒ๐ฟ: Transform tokens into numerical vectors โข ๐ฃ๐ผ๐๐ถ๐๐ถ๐ผ๐ป๐ฎ๐น ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด: Add sequence order information โข ๐ฆ๐ฒ๐น๐ณ-๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Each token attends to every other token โข ๐ ๐๐น๐๐ถ-๐ต๐ฒ๐ฎ๐ฑ ๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Capture multiple relationships simultaneously โข ๐๐ฎ๐๐ฒ๐ฟ ๐ป๐ผ๐ฟ๐บ๐ฎ๐น๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Stabilize values during processing โข ๐๐ผ๐ป๐๐ฒ๐ ๐ ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด: Build a deep contextual representation โข ๐๐ฒ๐ฎ๐๐๐ฟ๐ฒ ๐ฒ๐ ๐๐ฟ๐ฎ๐ฐ๐๐ถ๐ผ๐ป: Identify patterns and meaning in the text โข ๐๐น๐ฎ๐๐๐ถ๐ณ๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป ๐น๐ฎ๐๐ฒ๐ฟ: Map representations to predictions โข ๐ฃ๐ฟ๐ฒ๐ฑ๐ถ๐ฐ๐๐ฒ๐ฑ ๐น๐ฎ๐ฏ๐ฒ๐น๐: Output results like sentiment, topic, or category ___________ 3. ๐ ๐ถ๐ ๐๐๐ฟ๐ฒ ๐ผ๐ณ ๐๐ ๐ฝ๐ฒ๐ฟ๐๐ (๐ ๐ผ๐) MoE models improve ๐ฒ๐ณ๐ณ๐ถ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐ ๐ถ๐ป ๐น๐ฎ๐ฟ๐ด๐ฒ ๐๐ ๐บ๐ผ๐ฑ๐ฒ๐น๐ by activating only a few specialized networks. โข ๐ง๐ผ๐ธ๐ฒ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Break input text into tokens โข ๐๐ฎ๐๐ถ๐ป๐ด ๐๐๐๐๐ฒ๐บ: A router decides which experts should process tokens โข ๐ฅ๐ผ๐๐๐ถ๐ป๐ด: Send tokens to the most relevant expert networks โข ๐๐ต๐ผ๐ผ๐๐ฒ ๐ฒ๐ ๐ฝ๐ฒ๐ฟ๐๐: Activate only a small subset of experts โข ๐๐ ๐ฝ๐ฒ๐ฟ๐ ๐ฐ๐ผ๐บ๐ฝ๐๐๐ฎ๐๐ถ๐ผ๐ป: Each expert processes the assigned tokens โข ๐ ๐ฒ๐ฟ๐ด๐ฒ ๐ฟ๐ฒ๐๐๐น๐๐: Combine outputs from multiple experts โข ๐ช๐ฒ๐ถ๐ด๐ต๐๐ฒ๐ฑ ๐๐๐บ: Assign importance scores to expert outputs โข ๐๐ผ๐บ๐ฏ๐ถ๐ป๐ฒ๐ฑ ๐ผ๐๐๐ฝ๐๐: Merge expert responses into a unified representation โข ๐๐ผ๐ฟ๐๐ฎ๐ฟ๐ฑ ๐น๐ฎ๐๐ฒ๐ฟ: Further refine the combined result โข ๐๐ถ๐ป๐ฎ๐น ๐ผ๐๐๐ฝ๐๐: Produce an efficient and accurate prediction โ Repost for people in your network so they can understand this.
Crypto News & Analys...ย โขย 1y
Goatseus Maximus is a cryptocurrency token on the Solana blockchain, known as the $GOAT. It combines fun internet memes with financial goals to enable fast and secure transactions. Tech expert Andy Ayrey created it to give users a unique and rewardin
See MoreFounder | Agentic AI...ย โขย 2m
People think these 3 AI terms are same, they're not. Iโve explained differences for each. 1. ๐๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐๐ฒ ๐๐ Using AI to create content across text, images, audio, and video. โข ๐๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด-๐๐ฒ๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด + ๐๐ฎ๐๐ฒ๐ป๐ ๐ฆ๐ฝ๐ฎ๐ฐ๐ฒ:
See MoreFounder | Agentic AI...ย โขย 4m
Most people don't even know these basics of LLM's. I've explained it in a simple way below. 1. ๐๐ฎ๐๐ฎ ๐๐ผ๐น๐น๐ฒ๐ฐ๐๐ถ๐ผ๐ป LLMs are trained on massive amounts of text from books, websites, articles, and documents so they can learn how language is
See MoreFounder | Agentic AI...ย โขย 4m
Most people donโt know how Gen AI really works. Iโve explained core models in simple way below. 1. ๐๐ถ๐ณ๐ณ๐๐๐ถ๐ผ๐ป ๐ ๐ผ๐ฑ๐ฒ๐น๐ They learn by ๐ฎ๐ฑ๐ฑ๐ถ๐ป๐ด ๐ป๐ผ๐ถ๐๐ฒ to data and then learning how to ๐ฟ๐ฒ๐บ๐ผ๐๐ฒ ๐๐ต๐ฎ๐ ๐ป๐ผ๐ถ๐๐ฒ step by step.
See Moreย โขย
Medialย โขย 1m
๐๏ธ Medial Bulletin ๐ Geopolitics: The "Legal Lock" on Trade Supreme Court Shockwave: The highly anticipated India-US Trade Deal (the 18% tariff agreement) has hit a massive legal snag. The US Supreme Court just struck down the International Emerge
See MoreDownload the medial app to read full posts, comements and news.