Artificial Intellige...ย โขย 6m
DeepSeek published a new paper detailing the technical details behind its R1 model that shook up the AI space in January, also revealing that it cost just $294,000 to train.
Finding my self ๐ถโ๏ฟฝ...ย โขย 1y
๐คฏChinese AI startup DeepSeek has surpassed OpenAI's ChatGPT in downloads on the US Apple App Store, achieving this milestone just a week after its launch on January 10, 2025. ๐ The DeepSeek R1 model, which utilizes a hybrid architecture for enhanc
See More
Building plany.space...ย โขย 7m
1 day ago OpenAI released GPT-OSS-120B and GPT-OSS-20B, two massive open-weight models. Hereโs everything you need to know about them ๐ 2/ Key features: โ 256K context window โ Sliding window attention โ MoE architecture โ RoPE variant โ New MXFP4
See MorePython Developer ๐ป ...ย โขย 1y
3B LLM outperforms 405B LLM ๐คฏ Similarly, a 7B LLM outperforms OpenAI o1 & DeepSeek-R1 ๐คฏ ๐คฏ LLM: llama 3 Datasets: MATH-500 & AIME-2024 This has done on research with compute optimal Test-Time Scaling (TTS). Recently, OpenAI o1 shows that Test-
See More
AI Deep Explorer | f...ย โขย 11m
Want to learn AI the right way in 2025? Donโt just take courses. Donโt just build toy projects. Look at whatโs actually being used in the real world. The most practical way to really learn AI today is to follow the models that are shaping the indus
See MoreAI Deep Explorer | f...ย โขย 11m
LLM Post-Training: A Deep Dive into Reasoning LLMs This survey paper provides an in-depth examination of post-training methodologies in Large Language Models (LLMs) focusing on improving reasoning capabilities. While LLMs achieve strong performance
See Moreย โขย
YouTubeย โขย 1y
Researchers at Meta recently presented โAn Introduction to Vision-Language Modelingโ, to help people better understand the mechanics behind mapping vision to language. The paper includes everything from how VLMs work, how to train them, and approache
See More
Founder Snippetz Lab...ย โขย 8m
I didnโt think Iโd enjoy reading 80+ pages on training AI models. But this one? I couldnโt stop. Hugging Face dropped a playbook on how they train massive models across 512 GPUs โ and itโs insanely good. Not just technical stuffโฆ itโs like reading a
See More
Download the medial app to read full posts, comements and news.