Categories: NEWS

OpenAI Requests Additional Data from the Public to Enhance Its AI Training Models

OpenAI, having already trained its AI models using vast internet data, is now in search of domain-specific data – and it could be sourced from you.

Having exhaustively scraped the entire internet to train its AI models, OpenAI now requires additional data to enhance its capabilities further. The organization is reaching out to the public, asking for domain-specific data to augment the knowledge base of its AI models.

Having already trained its AI models on the entire internet, OpenAI is seeking domain-specific data to further sharpen these systems’ knowledge – and it is asking the public for help.

The maker of ChatGPT said it will work with organizations to produce public and private datasets under a new program, the OpenAI Data Partnerships, to train models like GPT-4 and the new GPT-4 Turbo.

OpenAI is interested in helping curate large-scale datasets that “reflect human society and that are not already easily accessible online to the public today.”

It said it can work with “any” modality or form of content including text, images, audio and video. The Microsoft-backed startup said it would like data that “expresses human intention” – like long-form writing or conversations rather than disconnected snippets.

OpenAI said it is already working with a few parties – including the Icelandic Government and Miðeind ehf. to improve GPT-4’s ability to speak Icelandic using a specially curated dataset.

OpenAI has also partnered with non-profit the Free Law Project, which aims to democratize access to legal understanding by including its large collection of legal documents in AI training.

“Data Partnerships are intended to enable more organizations to help steer the future of AI and benefit from models that are more useful to them, by including content they care about,” a company blog post reads.

No personal data, please

However, OpenAI does not want to work on datasets with sensitive or personal information or information that belongs to a third party.

Instead, OpenAI wants to build an open source dataset for training models which anyone can use. The company is also interested in preparing private datasets for training proprietary AI models.

Beyond datasets, OpenAI CEO Sam Altman said on Monday at the startup’s first developer conference, DevDay, that it would work with corporate clients to make custom AI models.

However, Altman warned that OpenAI “won’t be able to do this with many companies to start.”

“It’ll take a lot of work and in the interest of expectations, at least initially it won’t be cheap. But if you’re excited to push things as far as they can currently go, … we think we can do something pretty great.”

Altman later said the response to DevDay’s announcement of new models and updates is “far outpacing our expectations” and warned of “service instability” on its servers due to demand.

At around the same time, OpenAI confirmed that ChatGPT was the target of a DDoS attack by hackers. It was resolved in two days.

Source : AI Business

Tags: Aibusiness

2 years ago

Main author of PublicSphereTech

Next Nvidia Negotiating Run:ai Purchase in Major Transaction »

Previous « AI's Digital Forest: Eco-System Advancements at NVIDIA GTC

The Best Programming Language for Data Science

Abstract Choosing the right programming language is one of the most important decisions in modern analytics. With vast datasets, machine…

5 months ago

Data Science

The Best Python Library For Data Science

Abstract In the fast-evolving world of data science, choosing the right tools can make the difference between slow progress and…

5 months ago

Data Science

What is Reinforcement Learning

Abstract In today’s rapidly evolving world of artificial intelligence, Reinforcement Learning stands out as a dynamic and practical approach to…

5 months ago

Data Science

NoSQL for Data Science

Abstract The world of data science is expanding at a pace faster than ever before. With the rise of unstructured…

5 months ago

Food

The Role of AI in Managing Weight Gain

In today's technologically advanced world, artificial intelligence AI has become an integral part of various sectors including weight gain, revolutionizing…

5 months ago

Food

The Revolutionary Impact of AI on Weight Loss

The world of health and fitness is undergoing a remarkable transformation, and at the heart of this revolution is Artificial…

5 months ago

OpenAI Requests Additional Data from the Public to Enhance Its AI Training Models

Recent Posts

The Best Programming Language for Data Science

The Best Python Library For Data Science

What is Reinforcement Learning

NoSQL for Data Science

The Role of AI in Managing Weight Gain

The Revolutionary Impact of AI on Weight Loss