Post on Medial

Shuvodip Ray

 • 

YouTube • 6m

Researchers at Google DeepMind introduced Semantica, an image-conditioned diffusion model capable of generating images based on the semantics of a conditioning image. The paper explores adapting image generative models to different datasets. Instead of finetuning each model, which is impractical for large-scale models, Semantica uses in-context learning. It is trained on web-scale image pairs, where one random image from a webpage is used to condition the generation of another image from the same page, assuming these images share semantic traits

2 replies3 likes
Replies (2)

More like this

Recommendations from Medial

Image Description
Image Description

Aryan patil

 • 

Monkey Ads • 7m

Train your Ai model for free here 👇 no. Technical skills required to do this. You can train an image,audio,pose models here...

3 replies9 likes
Image Description

Three Commas Gang

Stealth • 6m

AI solution for marketers and product sellers! Building a webapp for you to just upload product image, and input a prompt and voila!!! You can get studio quality images instantly in any orientation and style possible. Will this work better in subsc

See More
3 replies1 like
Image Description
Image Description

Hawk

 • 

Medial • 3m

Midjourney 6.1 is here! What’s new in V6.1? - More coherent images (arms, legs, hands, bodies, plants, animals, etc) - Much better image quality (reduced pixel artifacts, enhanced textures, skin, 8bit retro, etc) - More precise, detailed, and correc

See More
8 replies18 likes
Image Description

Uttkarsh Singh

Stealth • 4m

📌Another Harvard dropout are killing This time it's Gavin Uberti and Chris Zhu who have created the fastest AI chip in the world beating Nvidia as they say. Their startup name is Etched and their chip name is Sohu Etched says Sohu can achieve ov

See More
2 replies9 likes
1
Anonymous
Image Description

Huge announcement from Meta. Welcome Llama 3.1🔥 This is all you need to know about it: The new models: - The Meta Llama 3.1 family of multilingual large language models (LLMs) is a collection of pre-trained and instruction-tuned generative models

See More
1 replies4 likes
1

Baqer Ali

Stealth • 6m

Open AI has realease a bomb in the tech industry The new GPT4o model is amazing Here 'o' stands for omni which means everything It can process text audio and also visuals (images and videos) The demo for this new model is jaw dropping and this

See More
0 replies4 likes
Image Description

Aura

Stealth • 4m

New and Improved AI Models: Inflection-2: Google AI introduced Inflection-2, a large language model excelling at reasoning and following instructions. This could pave the way for AI assistants that can understand and complete complex tasks. Claude

See More
1 replies2 likes

Saksham

 • 

Bebyond • 3m

The Future of Licensing Deals: Is Subscription the New Frontier? As technology reshapes industries, the traditional licensing model is ripe for disruption. Could subscription-based licensing be the next big thing? Let's explore this trend and its im

See More
0 replies5 likes
Image Description

Jeet Sarkar

Stealth • 7m

Microsoft recently revealed VASA-1, an impressive generative AI model that can turn a single still photo into a believable video. That's fuckin scary. Here is simple explanation of how does it work: Essentially, VASA-1 examines a still image and us

See More
1 replies4 likes
Image Description
Image Description

AjayEdupuganti

Stealth • 8m

Microsoft (OpenAI) + Nvidia vs Google + Apple. Edge AI refers to computing on your device, as opposed to the cloud. The first company to successfully implement this could capture a significant market share. The key players are expected to be Micro

See More
2 replies7 likes

Download the medial app to read full posts, comements and news.