Connect with us

News

OpenAI Establishes Five-Level System To Gauge AI Progress

The ChatGPT creator revealed the new classification system to employees during a recent company-wide meeting.

Published

on

openai establishes five-level system to gauge ai progress

OpenAI has introduced a five-tier framework to monitor its advancement toward developing artificial intelligence that can rival and even surpass human capabilities.

The initiative is the latest in the startup’s efforts to enhance public understanding of AI safety and was shared with staff during a company-wide meeting on Tuesday, July 9. OpenAI intends to present the levels to investors and other stakeholders, which span from conversational AI (Level 1) to AI that can independently operate an entire organization (Level 5).

During the meeting, OpenAI executives informed employees that the company is currently at the first level but is nearing the second level, known as Reasoners. This tier represents AI systems capable of basic problem-solving tasks comparable to a human with a doctorate-level education.

In the same session, OpenAI’s leadership demonstrated a research project featuring the GPT-4 AI model, showcasing new skills indicative of human-like reasoning. For years, the company has been working towards achieving what is often referred to as artificial general intelligence (AGI), which entails creating computers that can outperform humans in most tasks. Such systems do not yet exist, though OpenAI CEO Sam Altman has previously suggested that AGI might be achievable later this decade.

Also Read: The Most AI-Proof Career Opportunities In The Middle East

Determining the criteria for AGI has been a topic of ongoing debate among AI researchers. In a paper published in November 2023, researchers at Google DeepMind proposed a framework of five ascending AI levels, including “expert” and “superhuman”, which resembles the classification system used in the automotive industry for self-driving cars.

According to OpenAI’s proposed levels, the third tier on the road to AGI is called Agents, representing AI systems that can perform tasks autonomously over several days. Level 4 describes AI that can generate new innovations, while the highest level, Organizations, refers to AI capable of managing entire enterprises.

The framework, developed by OpenAI executives and senior leaders, is considered a work in progress. The company plans to collect feedback from employees, investors, and its board, with the possibility of refining the levels over time.

Advertisement

📢 Get Exclusive Monthly Articles, Updates & Tech Tips Right In Your Inbox!

JOIN 21K+ SUBSCRIBERS

Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

News

UAE-Built Falcon-H1 Arabic Leads LLM Benchmarks

The lean Emirati-built language model beats larger global systems and puts Arabic at the center of training.

Published

on

uae-built falcon-h1 arabic leads llm benchmarks
Abu Dhabi Technology Innovation Institute

Abu Dhabi’s Technology Innovation Institute has released an Arabic-first large language model that tops global test boards, an uncommon edge for a region long served by English-centric systems.

Falcon-H1 Arabic comes in 3B, 7B and 34B versions. The flagship posts 75.36% accuracy on comprehensive Arabic tasks and ranks first on the Open Arabic LLM Leaderboard. It also outperforms Meta’s Llama-70B and Alibaba’s Qwen-72B while using less than half their parameters. The smallest model beats Microsoft’s Phi-4 Mini by ten percentage points on equivalent benchmarks.

Arabic remains hard territory for AI. Flexible word order, dense morphology and constant switching between regional dialects and Modern Standard Arabic leave many global models missing context or tone. Academic research has pointed to a shortage of annotated datasets for dialect and informal speech. The impact shows up in classrooms, call centers and government portals where Arabic chatbots lag their English counterparts.

TII trained Falcon-H1 Arabic on formal writing, dialects and culturally grounded content. Beyond scores, it handles practical use: long conversations, reasoning rather than literal translation, and inputs of up to 192,000 words — enough for medical records or legal filings.

“The aim is innovation that is accessible, relevant, and impactful,” said Faisal Al Bannai, Adviser to the UAE President and Secretary-General of the Advanced Technology Research Council.

Also Read: Governata Raises $4M For Saudi AI Data-Governance Push

Arabic is spoken by more than 450 million people across over 20 countries, yet has often been treated as a secondary language for foundation models. The UAE move signals a push to flip that logic and build Arabic-native stacks rather than wait for global systems to improve.

Falcon models have led their categories since 2023. With H1 Arabic, TII is offering free access via chat.falconllm.tii.ae for developers, media, healthcare and public-sector users looking to automate in natural Arabic.

As the region continues to invest in sovereign computing and data localization, the addition of Falcon-H1 Arabic adds a powerful tool built for the native language, instead of an afterthought attached to an English-trained system.

Continue Reading

#Trending