🚀 Medial Secures Investment on Shark Tank India - Fueling the Future of Professional Social Networking. 🔥

Startup Showcase

Premium Content

Try our Valuation Calculator →

News on Medial

Here's Proof You Can Train an AI Model Without Slurping Copyrighted Content

Wired · 2y ago

Here's Proof You Can Train an AI Model Without Slurping Copyrighted Content

French researchers have released a large AI training dataset composed entirely of text in the public domain, challenging the belief that copyrighted materials are necessary for training AI models. Meanwhile, non-profit organization Fairly Trained has awarded its first certification for a large language model built without copyright infringement. Developed by 273 Ventures, the KL3M model was trained on a curated dataset of legal, financial, and regulatory documents. Fairly Trained certifies companies that train their AI models using data they own, have licensed, or is in the public domain. The availability of infringement-free datasets like these could revolutionize the AI industry's reliance on copyrighted materials.

Related News

Indian news agency sues OpenAI alleging copyright infringement

TechCrunch · 1y ago

Indian news agency sues OpenAI alleging copyright infringement

Asian News International (ANI) has filed a lawsuit against OpenAI in the Delhi High Court, accusing the AI company of using its copyrighted news content without permission. ANI alleges that OpenAI used its content to train its AI models and generated false information attributed to the news agency. This is the first time an Indian media organization has taken legal action against OpenAI over copyright claims. OpenAI confirmed it has ensured that its ChatGPT model no longer accesses ANI's website. The court plans to appoint an independent expert to advise on the copyright implications of AI models using publicly available content.

OpenAI denies infringement allegations in author copyright cases

Economic Times

Economic Times · 1y ago

OpenAI denies infringement allegations in author copyright cases

OpenAI has filed a response to copyright infringement allegations in court, stating that it makes fair use of copyrighted content to train its AI language model. The company argues that training AI models requires building on previous ideas and that fair use protection enables the development of new concepts. This comes in response to lawsuits filed by authors, including Michael Chabon and Ta-Nehisi Coates, against OpenAI and Meta Platforms last year. The lawsuits raise questions about whether using scraped material from the internet to train AI infringes copyrighted material on a large scale.

Microsoft sued by authors over use of books in AI training - The Economic Times

Economic Times

Economic Times · 9m ago

Microsoft sued by authors over use of books in AI training - The Economic Times

A group of authors is suing Microsoft, alleging the company used about 200,000 pirated books without permission to train its AI model, Megatron. Filed in a New York federal court, the lawsuit seeks damages and an injunction against further use. This case joins several legal challenges concerning the unauthorized use of copyrighted material in AI training, highlighting ongoing debates around copyright infringement and fair use in the AI industry.

ANI vs OpenAI: Delhi HC to hear India’s first GenAI copyright suit on 21 Feb

Livemint

Livemint · 1y ago

ANI vs OpenAI: Delhi HC to hear India’s first GenAI copyright suit on 21 Feb

The Delhi High Court will start hearings in February 2024 on ANI Media's lawsuit against OpenAI, alleging it used ANI's copyrighted content to train its language models without permission. This case could significantly impact AI regulation and copyright laws in India. ANI seeks ₹2 crore in damages and an injunction against OpenAI. The case's outcome might set precedents for AI-related copyright disputes and influence India's future AI legislation and international governance.

Apple sued over use of copyrighted books to train Apple Intelligence - The Economic Times

Economic Times

Economic Times · 5m ago

Apple sued over use of copyrighted books to train Apple Intelligence - The Economic Times

Apple is facing a lawsuit in California for allegedly using thousands of pirated books to train its AI model, Apple Intelligence. Neuroscientists Susana Martinez-Conde and Stephen Macknik claim Apple utilized copyrighted material, including their works, from illegal "shadow libraries." This case is among several lawsuits against tech companies for unauthorized use of copyrighted content in AI training. Apple Intelligence, integrated into iOS devices, allegedly used these pirated books, causing the lawsuit to seek damages and cessation of misuse.

News Portals Flag Use Of Content By AI Giants Without Consent

Inc42 · 9m ago

News Portals Flag Use Of Content By AI Giants Without Consent

News publishers have expressed concerns over tech giants using their copyrighted data to train AI models without consent, viewing it as copyright infringement. Tech companies claim they need large datasets, while publishers seek a fair compensation regime for content creators. Stakeholders debate if regulations should allow text and data mining, offering content creators the choice to opt out. This issue was highlighted in a DPIIT-organized consultation amid ongoing discussions between publishers and AI firms.

OpenAI says it’s “impossible” to create useful AI models without copyrighted material

Arstechnica

Arstechnica · 2y ago

OpenAI says it’s “impossible” to create useful AI models without copyrighted material

ChatGPT developer OpenAI has acknowledged that the development of AI tools like ChatGPT would be "impossible" without using copyrighted material. OpenAI made this statement in response to an inquiry by the UK's House of Lords about the use of copyrighted content in AI training. The practice of scraping copyrighted content for training AI models has come under scrutiny, particularly with the recent commercialization of deep learning AI models. OpenAI asserts that limitations to public domain content would not meet the needs of today's citizens, and claims fair use in their defense against copyright lawsuits.

OpenAI takes on TikTok, YouTube Shorts with Sora 2 video app; copyright a potential issue - The Economic Times

Economic Times

Economic Times · 6m ago

OpenAI takes on TikTok, YouTube Shorts with Sora 2 video app; copyright a potential issue - The Economic Times

OpenAI has launched Sora 2, an advanced video generation AI model, along with the Sora app to compete with platforms like TikTok and YouTube Shorts. Sora 2 can create high-definition, sound-synced clips and enables users to insert themselves into videos. The app emphasizes content from users' networks and employs a Cameo feature requiring identity verification for personalized deepfake content. Copyright issues arise, as the model uses copyrighted content unless opted out by rights holders.

Meta Accused of Torrenting Porn to Advance Its Goal of AI ‘Superintelligence’

Wired · 6m ago

Meta Accused of Torrenting Porn to Advance Its Goal of AI ‘Superintelligence’

Strike 3 Holdings is suing Meta, accusing it of pirating adult content to train AI models, aiming to enhance AI 'superintelligence'. The lawsuit alleges Meta used BitTorrent to download and distribute copyrighted videos, making them accessible to minors, violating copyright laws. Strike 3 claims this tactic was used to gain competitive advantage in AI development. Meta denies the allegations, and the case highlights ongoing legal challenges over AI training on copyrighted content.

Jensen Huang-led Nvidia faces copyright infringement lawsuit over AI training

IndianStartupNews

IndianStartupNews · 2y ago

Jensen Huang-led Nvidia faces copyright infringement lawsuit over AI training

Authors Brian Keene, Abdi Nazemian, and Stewart O'Nan have filed a lawsuit against Nvidia, alleging that the tech company used their copyrighted works without permission to train its NeMo AI platform. The dataset, consisting of nearly 200,000 books, was removed in October after copyright infringement claims. Nvidia's NeMo AI is a platform for generative AI technologies, allowing the creation of new content from various inputs. The authors are seeking unspecified damages for all US authors whose works were used to train NeMo over the past three years. This case highlights the ongoing issue of copyright infringement in AI development.

Trackers

Active Indian VC’s

OG Capital Email

With a hands-on approach, OG Capital aims to invest in over 20 promising...

Accel Partners Email

Early and growth-stage investments in disruptive technology companies with...

Early-stage venture capital firm investing in technology startups in India. Focus on...

Access All Trackers

Startup Showcase Winners

Jan 2026

The New Era of Transparent Healthcare

Powering India's AI boom with indigenous hardware

Borrow. Rent. Share- Instead of Buying

Enter Ongoing Startup Showcase

Top Users

Trending News on Medial

Rediff files confidential IPO pap ...

Claude Code Leak: What Developers ...

ixigo-backed SqaaS launches AI ag ...

Download the medial app to read full posts, comements and news.