Founder | Agentic AI...ย โขย 12h
3 transformer architectures everyone should know. I've explained it in a simple way below. 1. ๐๐ฒ๐ฐ๐ผ๐ฑ๐ฒ๐ฟ-๐ข๐ป๐น๐ ๐ ๐ผ๐ฑ๐ฒ๐น๐ These are mainly used for ๐๐ฒ๐ ๐ ๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ผ๐ป (like ChatGPT). They predict the ๐ป๐ฒ๐ ๐ ๐๐ผ๐ธ๐ฒ๐ป ๐๐๐ฒ๐ฝ ๐ฏ๐ ๐๐๐ฒ๐ฝ. โข ๐ง๐ผ๐ธ๐ฒ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Break input text into smaller tokens โข ๐ฃ๐ผ๐๐ถ๐๐ถ๐ผ๐ป๐ฎ๐น ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด: Add position info so the model understands word order โข ๐ฆ๐ฒ๐น๐ณ-๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Each token looks at previous tokens for context โข ๐ค๐๐ฒ๐ฟ๐/๐๐ฒ๐/๐ฉ๐ฎ๐น๐๐ฒ: Mechanism used to measure token relationships โข ๐๐ฎ๐๐๐ฎ๐น ๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Tokens can only see earlier tokens, not future ones โข ๐๐ฒ๐ฒ๐ฑ๐ณ๐ผ๐ฟ๐๐ฎ๐ฟ๐ฑ ๐น๐ฎ๐๐ฒ๐ฟ: Neural layer refines token representations โข ๐๐ผ๐ป๐๐ฒ๐ ๐ ๐๐ฝ๐ฑ๐ฎ๐๐ฒ: Model refreshes its internal understanding โข ๐ก๐ฒ๐ ๐ ๐๐ผ๐ธ๐ฒ๐ป ๐ฝ๐ฟ๐ฒ๐ฑ๐ถ๐ฐ๐๐ถ๐ผ๐ป: Predict the most probable next word โข ๐๐๐๐ผ๐ฟ๐ฒ๐ด๐ฟ๐ฒ๐๐๐ถ๐๐ฒ ๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ผ๐ป: Repeat prediction until the full response is formed โข ๐ข๐๐๐ฝ๐๐ ๐๐ฒ๐พ๐๐ฒ๐ป๐ฐ๐ฒ: Generated text becomes the final result ___________ 2. ๐๐ป๐ฐ๐ผ๐ฑ๐ฒ๐ฟ-๐ข๐ป๐น๐ ๐ ๐ผ๐ฑ๐ฒ๐น๐ They focus on ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐๐ต๐ฎ๐ ๐๐ป๐ฑ๐ฒ๐ฟ๐๐๐ฎ๐ป๐ฑ ๐๐ฒ๐ ๐ rather than generating it, like classification, embeddings, and search tasks. โข ๐ง๐ผ๐ธ๐ฒ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Convert text into tokens โข ๐๐บ๐ฏ๐ฒ๐ฑ๐ฑ๐ถ๐ป๐ด ๐น๐ฎ๐๐ฒ๐ฟ: Transform tokens into numerical vectors โข ๐ฃ๐ผ๐๐ถ๐๐ถ๐ผ๐ป๐ฎ๐น ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด: Add sequence order information โข ๐ฆ๐ฒ๐น๐ณ-๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Each token attends to every other token โข ๐ ๐๐น๐๐ถ-๐ต๐ฒ๐ฎ๐ฑ ๐ฎ๐๐๐ฒ๐ป๐๐ถ๐ผ๐ป: Capture multiple relationships simultaneously โข ๐๐ฎ๐๐ฒ๐ฟ ๐ป๐ผ๐ฟ๐บ๐ฎ๐น๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Stabilize values during processing โข ๐๐ผ๐ป๐๐ฒ๐ ๐ ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด: Build a deep contextual representation โข ๐๐ฒ๐ฎ๐๐๐ฟ๐ฒ ๐ฒ๐ ๐๐ฟ๐ฎ๐ฐ๐๐ถ๐ผ๐ป: Identify patterns and meaning in the text โข ๐๐น๐ฎ๐๐๐ถ๐ณ๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป ๐น๐ฎ๐๐ฒ๐ฟ: Map representations to predictions โข ๐ฃ๐ฟ๐ฒ๐ฑ๐ถ๐ฐ๐๐ฒ๐ฑ ๐น๐ฎ๐ฏ๐ฒ๐น๐: Output results like sentiment, topic, or category ___________ 3. ๐ ๐ถ๐ ๐๐๐ฟ๐ฒ ๐ผ๐ณ ๐๐ ๐ฝ๐ฒ๐ฟ๐๐ (๐ ๐ผ๐) MoE models improve ๐ฒ๐ณ๐ณ๐ถ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐ ๐ถ๐ป ๐น๐ฎ๐ฟ๐ด๐ฒ ๐๐ ๐บ๐ผ๐ฑ๐ฒ๐น๐ by activating only a few specialized networks. โข ๐ง๐ผ๐ธ๐ฒ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป: Break input text into tokens โข ๐๐ฎ๐๐ถ๐ป๐ด ๐๐๐๐๐ฒ๐บ: A router decides which experts should process tokens โข ๐ฅ๐ผ๐๐๐ถ๐ป๐ด: Send tokens to the most relevant expert networks โข ๐๐ต๐ผ๐ผ๐๐ฒ ๐ฒ๐ ๐ฝ๐ฒ๐ฟ๐๐: Activate only a small subset of experts โข ๐๐ ๐ฝ๐ฒ๐ฟ๐ ๐ฐ๐ผ๐บ๐ฝ๐๐๐ฎ๐๐ถ๐ผ๐ป: Each expert processes the assigned tokens โข ๐ ๐ฒ๐ฟ๐ด๐ฒ ๐ฟ๐ฒ๐๐๐น๐๐: Combine outputs from multiple experts โข ๐ช๐ฒ๐ถ๐ด๐ต๐๐ฒ๐ฑ ๐๐๐บ: Assign importance scores to expert outputs โข ๐๐ผ๐บ๐ฏ๐ถ๐ป๐ฒ๐ฑ ๐ผ๐๐๐ฝ๐๐: Merge expert responses into a unified representation โข ๐๐ผ๐ฟ๐๐ฎ๐ฟ๐ฑ ๐น๐ฎ๐๐ฒ๐ฟ: Further refine the combined result โข ๐๐ถ๐ป๐ฎ๐น ๐ผ๐๐๐ฝ๐๐: Produce an efficient and accurate prediction โ Repost for people in your network so they can understand this.

Crypto News & Analys...ย โขย 1y
Goatseus Maximus is a cryptocurrency token on the Solana blockchain, known as the $GOAT. It combines fun internet memes with financial goals to enable fast and secure transactions. Tech expert Andy Ayrey created it to give users a unique and rewardin
See MoreFounder | Agentic AI...ย โขย 1d
People think these 3 AI terms are same, they're not. Iโve explained differences for each. 1. ๐๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐๐ฒ ๐๐ Using AI to create content across text, images, audio, and video. โข ๐๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด-๐๐ฒ๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด + ๐๐ฎ๐๐ฒ๐ป๐ ๐ฆ๐ฝ๐ฎ๐ฐ๐ฒ:
See More
Founder | Agentic AI...ย โขย 2m
Most people don't even know these basics of LLM's. I've explained it in a simple way below. 1. ๐๐ฎ๐๐ฎ ๐๐ผ๐น๐น๐ฒ๐ฐ๐๐ถ๐ผ๐ป LLMs are trained on massive amounts of text from books, websites, articles, and documents so they can learn how language is
See More
Founder | Agentic AI...ย โขย 1m
Most people donโt know how Gen AI really works. Iโve explained core models in simple way below. 1. ๐๐ถ๐ณ๐ณ๐๐๐ถ๐ผ๐ป ๐ ๐ผ๐ฑ๐ฒ๐น๐ They learn by ๐ฎ๐ฑ๐ฑ๐ถ๐ป๐ด ๐ป๐ผ๐ถ๐๐ฒ to data and then learning how to ๐ฟ๐ฒ๐บ๐ผ๐๐ฒ ๐๐ต๐ฎ๐ ๐ป๐ผ๐ถ๐๐ฒ step by step.
See More
Download the medial app to read full posts, comements and news.