OpenAI announces Safety Gym for reinforcement learning

OpenAI has announced Safety Gym, a solution for training AI models through reinforcement learning. Reinforcement learning is the training of AI models using punishments or rewards.

A number of companies, including Intel’s Mobileye as well as Nvidia, have proposed a framework to ensure safe and logical decision making by AI models. The U.S. company OpenAI has therefore devised Safety Gym, a collection of tools for the development of AI models that has certain safety restrictions during training. Also, the level of safety of certain algorithms and the extent to which those algorithms avoid errors while learning can be compared, writes VentureBeat.

New approach

OpenAI has devised a new form of learning by punishment and reward. This reinforcement learning implements functions that limit the AI, but at the same time provide it with a greater degree of security. For example, models for self-driving vehicles can be made significantly safer. The OpenAI approach is called ‘constrained reinforcement learning’, and according to OpenAI it is a step towards much safer artificial intelligence.

The company explains the approach as follows in a blog post, with a model for autonomous vehicles as an example: “In normal reinforcement learning, you would pick the collision fine at the beginning of training and keep it fixed forever. The problem here is that if the pay-per-trip is high enough, the agent may not care whether it gets in lots of collisions (as long as it can still complete its trips). [With] constrained reinforcement learning, you would pick the acceptable collision rate at the beginning of training, and adjust the collision fine until the agent is meeting that requirement.”

Stay tuned, subscribe!

OpenAI announces Safety Gym for reinforcement learning

Tags in this article

New approach

Events - Techcalendar

NIS2: Een pragmatische aanpak voor een weerbare Organisatie

Microsoft Discovery Workshop Infrastructure & Database Modernization

Red Hat Summit

Top Stories

Newest ASML machine at Intel is ready to go, with plenty of R&D ahead

How did Phishing-as-a-Service group LabHost operate?

Cisco Hypershield: new security architecture protects agains new (and old) problems

Process HQ steals the show in 24.2 release of Appian Platform

Google Chat must compete with Slack and Teams

Recent news

The ability to offer iOS apps through your own website is now finally here

Google moves part of operations and lays off employees once more

French AI startup Mistral AI again looking for investors

Java highly vulnerable relative to other programming languages

ASML sends High-NA EUV machine to second customer

NetSuite Analytics Warehouse available in 11 new countries

Stay tuned, subscribe!

Tags in this article

New approach

Related articles

Linux Foundation drives open-source AI with creation of OPEA

Deepmind CEO expects to spend more than $100 billion on AI

OpenAI trained GPT-4 on millions of hours of YouTube audio

OpenAI makes fine-tuning of its LLMs easier and cheaper

Caracal release of OpenStack bets on AI workloads and VMware refugees

Events - Techcalendar

NIS2: Een pragmatische aanpak voor een weerbare Organisatie

Microsoft Discovery Workshop Infrastructure & Database Modernization

Red Hat Summit

Top Stories

Newest ASML machine at Intel is ready to go, with plenty of R&D ahead

How did Phishing-as-a-Service group LabHost operate?

Cisco Hypershield: new security architecture protects agains new (and old) problems

Process HQ steals the show in 24.2 release of Appian Platform

Google Chat must compete with Slack and Teams

Recent news

The ability to offer iOS apps through your own website is now finally here

Google moves part of operations and lays off employees once more

French AI startup Mistral AI again looking for investors

Java highly vulnerable relative to other programming languages

ASML sends High-NA EUV machine to second customer

NetSuite Analytics Warehouse available in 11 new countries