Close Menu
  • AI
  • Content Creation
  • Tech
  • Robotics
AI-trends.todayAI-trends.today
  • AI
  • Content Creation
  • Tech
  • Robotics
Trending
  • The Coding Guide to Property Based Testing with Hypothesis and Stateful, Differential and Metamorphic Test Designs
  • Schematik Is ‘Cursor for Hardware.’ The Anthropics Want In
  • Hacking the EU’s new age-verification app takes only 2 minutes
  • Google AI Releases Google Auto-Diagnosis: A Large Language Model LLM Based System to Diagnose Integrity Test Failures At Scale
  • This is a complete guide to running OpenAI’s GPT-OSS open-weight models using advanced inference workflows.
  • The Huey Code Guide: Build a High-Performance Background Task Processor Using Scheduling with Retries and Pipelines.
  • Top 19 AI Red Teaming Tools (2026): Secure Your ML Models
  • OpenAI’s Kevin Weil is Leaving The Company
AI-trends.todayAI-trends.today
Home»AI»OpenAI asks contractors to upload past work to assess the performance of AI agents

OpenAI asks contractors to upload past work to assess the performance of AI agents

AI By Gavin Wallace10/01/20264 Mins Read
Facebook Twitter LinkedIn Email
GPT-4o Tells Jokes about AI • AI Blog
GPT-4o Tells Jokes about AI • AI Blog
Share
Facebook Twitter LinkedIn Email

OpenAI is asking Third-party contractors can upload actual assignments and tasks they have performed at their present or former workplace so the company can evaluate the performance its next-generation workforce. AI modelsWIRED has obtained data from OpenAI, a training company that uses AI for its software and Handshake AI.

OpenAI appears to have launched the project as part of its efforts to create a baseline human performance for various tasks, which can be then compared to AI models. In September, OpenAI launched a new evaluation The process measures the performance of AI models against professionals in a wide range of industries. OpenAI claims that this is an important indicator for its progress in achieving AGI or AI systems which outperform humans on the most valuable economic tasks.

“We’ve hired folks across occupations to help collect real-world tasks modeled off those you’ve done in your full-time jobs, so we can measure how well AI models perform on those tasks,” OpenAI’s confidential documents are available for reading. “Take existing pieces of long-term or complex work (hours or days+) that you’ve done in your occupation and turn each into a task.”

WIRED was shown an OpenAI project presentation that asked contractors to describe their tasks in current or past jobs and upload examples of real work. All examples should be uploaded. “a concrete output (not a summary of the file, but the actual file), e.g., Word doc, PDF, Powerpoint, Excel, image, repo,” Notes for the presentations. OpenAI said people could also share fake work examples to show how they’d respond realistically in certain scenarios.

OpenAI and Handshake AI have declined to make any comments.

According to OpenAI’s presentation, real-world tasks are composed of two parts. Task requests (what someone’s boss or coworker told them to accomplish) are separated from task deliverables (the work that was produced to meet the request). It is stressed in the instructions of the company that contractors’ examples should show how they have met their goals. “real, on-the-job work” This person is a “actually done.”

In the OpenAI Presentation, one example outlines how to perform a task using a “Senior Lifestyle Manager at a luxury concierge company for ultra-high-net-worth individuals.” Goal is to “prepare a short, 2-page PDF draft of a 7-day yacht trip overview to the Bahamas for a family who will be traveling there for the first time.” Included are details on the family interests, and how the trip should be planned. It includes additional details regarding the family’s interests and what itinerary should look like. “experienced human deliverable” The contractor would then upload a Bahamas itinerary that was created by a client.

OpenAI tells contractors to remove corporate intellectual property as well as personally identifiable data from work files that they upload. Under a section labeled “Important reminders,” OpenAI tells employees to “remove or anonymize any: personal information, proprietary or confidential data, material nonpublic information (e.g., internal strategy, unreleased product details).”

In one of the documents viewed by WIRED, a tool named “ChatGPT” is mentioned.Superstar ScrubbingThis article provides instructions on how to remove confidential data.

Evan Brown, an intellectual property lawyer with Neal & McDevitt, tells WIRED that AI labs that receive confidential information from contractors at this scale could be subject to trade secret misappropriation claims. If contractors offer their documents, even if they have been scrubbing them, to a company that specializes in AI, then there is a risk of breaching nondisclosure or trade secret agreements with their prior employers.

“The AI lab is putting a lot of trust in its contractors to decide what is and isn’t confidential,” Brown, says “If they do let something slip through, are the AI labs really taking the time to determine what is and isn’t a trade secret? It seems to me that the AI lab is putting itself at great risk.”

artificial intelligence economy jobs Labor openai
Share. Facebook Twitter LinkedIn Email
Avatar
Gavin Wallace

Related Posts

Schematik Is ‘Cursor for Hardware.’ The Anthropics Want In

18/04/2026

Hacking the EU’s new age-verification app takes only 2 minutes

18/04/2026

OpenAI’s Kevin Weil is Leaving The Company

17/04/2026

Looking into Sam Altman’s Orb on Tinder Now proves that you are human

17/04/2026
Top News

AI Race Pressures Utilities To Squeeze Even More Power From Europe’s Grids

Trump and Energy Industry are Eager to Use Fossil Energy for AI

Trump’s AI Action Plan Is a Crusade Against ‘Bias’—and Regulation

Sora II is used to create disturbing videos with AI-generated children

Microsoft says its AI system can diagnose patients 4 times better than a human doctor

Load More
AI-Trends.Today

Your daily source of AI news and trends. Stay up to date with everything AI and automation!

X (Twitter) Instagram
Top Insights

IBM Granite 4.0 Releases 3B Vision: New Language Model for Enterprise Grade Data Extraction

02/04/2026

Buffer’s composer has been rebuilt from the inside out

11/03/2026
Latest News

The Coding Guide to Property Based Testing with Hypothesis and Stateful, Differential and Metamorphic Test Designs

19/04/2026

Schematik Is ‘Cursor for Hardware.’ The Anthropics Want In

18/04/2026
X (Twitter) Instagram
  • Privacy Policy
  • Contact Us
  • Terms and Conditions
© 2026 AI-Trends.Today

Type above and press Enter to search. Press Esc to cancel.