Get Over Q*, OpenAI takes AGI to the Next Level with PPO

The OpenAI drama ends. The real action begins with the company secretly working on Q* (possibly based on Q-learning), but there is another interesting technique which is OpenAI’s all time favourite — PPO (short for proximal policy optimisation).

OpenAI’s VP product Peter Welinder recently posted on X “Everyone reading up on Q-learning. Just wait until they hear about PPO.”

Everyone reading up on Q-learning. Just wait until they hear about PPO. 🤫
— Peter Welinder (@npew) November 23, 2023

What is PPO?

PPO is a reinforcement learning algorithm used to train artificial intelligence models to make decisions in complex, or simulated environments.

Interestingly, PPO became the default reinforcement learning algorithm at OpenAI in 2017 because of its ease of use and good performance.

The “proximal” in PPO’s name refers to the constraint applied to the policy updates. This constraint helps prevent significant policy changes, contributing to more stable and reliable learning.

OpenAI employs PPO due to its effectiveness in optimising policies for sequential decision-making tasks.

Moreover, PPO strikes a balance between exploration and exploitation, crucial in reinforcement learning, by incrementally updating policies while ensuring that the changes are constrained.

OpenAI adopts PPO in a variety of use cases, ranging from training agents in simulated environments to mastering complex games.

PPO’s versatility allows it to excel in scenarios where an agent must learn a sequence of actions to achieve a specific goal, making it valuable in fields such as robotics, autonomous systems, and algorithmic trading.

Chances are pretty much that OpenAI is aiming to achieve AGI through gaming and simulated environments with help of PPO.
Interestingly earlier, this year OpenAI acquired Global Illumination to train agents in a simulated environment.

The post Get Over Q*, OpenAI takes AGI to the Next Level with PPO appeared first on Analytics India Magazine.

Previous News

Online Meat Brand Zappfresh Raises INR 30 Cr For Acquisitions, Expanding Offerings

Next News

FTX’s FTT token rallies 28% — Binance effect or FTX 2.0 reopening?

What is PPO?

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

iOS 18.2 just added a faster way to message Siri and ChatGPT

A popular technique to make AI more efficient has drawbacks

ICAI Says Probe Into Alleged Audit Lapses At BYJU’S Still Ongoing

MobiKwik Shares Surge 12.5% To INR 549.80

Ranjita Ghosh: Wipro elevates Ranjita Ghosh as new global chief marketing officer

X raises Premium Plus subscription pricing by almost 40 percent

Get Over Q*, OpenAI takes AGI to the Next Level with PPO

What is PPO?

Disclaimer

More like this

iOS 18.2 just added a faster way to message...

A popular technique to make AI more efficient has...

ICAI Says Probe Into Alleged Audit Lapses At BYJU’S...

Popular

CRED Forays Into Insurance Vertical

Seizing A Trillion-Dollar Opportunity By 2030

Prediction markets are not being manipulated — Kalshi founder

8i Ventures Exits M2P Fintech With 12X Returns

US has 26M strong ‘crypto voting bloc’ ahead of elections — Survey

Elon Musk’s X is changing its privacy policy to allow third parties to train...

59 Cleantech Startups Working Towards Making India Greener

Upcoming Events

Odoo Business Show | Kolkata | December 23

iStart x BuilderX Program

Re-Live 24 The Hustlers's Party | Gurugram | December 28

Startup Networking | New Delhi | December 28

Startup Networking | Bangalore | December 28

StartupNews.fyi

StartupNews.fyi

Get Over Q*, OpenAI takes AGI to the Next Level with PPO

What is PPO?

Disclaimer

Popular

More Like this

Get Over Q*, OpenAI takes AGI to the Next Level with PPO

What is PPO?

Disclaimer

More like this

Popular

Upcoming Events

Newsletter Signup Form!

Newsletter Signup Form!