OpenAI’s Foundry will let customers buy dedicated compute to run its AI models

Share via:

OpenAI is quietly launching a new developer platform that will allow customers to run the company’s more recent machine learning models, such as GPT-3.5, on dedicated capacity. In screenshots of documentation shared on Twitter by early access users, OpenAI describes the upcoming Foundry offering as “designed for cutting-edge customers running larger workloads.”

Foundry will provide service-level commitments such as uptime and on-demand engineering support. Rentals will be based on dedicated compute units with three-month or one-year commitments; each model instance will require a certain number of compute units (see the chart below). Instances will not be inexpensive. Running a lightweight version of GPT-3.5 will cost $78,000 over three months or $264,000 over one year. To put that in context, one of Nvidia’s most recent-generation supercomputers, the DGX Station, costs $149,000 per unit.

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Popular

More Like this

OpenAI’s Foundry will let customers buy dedicated compute to run its AI models

OpenAI is quietly launching a new developer platform that will allow customers to run the company’s more recent machine learning models, such as GPT-3.5, on dedicated capacity. In screenshots of documentation shared on Twitter by early access users, OpenAI describes the upcoming Foundry offering as “designed for cutting-edge customers running larger workloads.”

Foundry will provide service-level commitments such as uptime and on-demand engineering support. Rentals will be based on dedicated compute units with three-month or one-year commitments; each model instance will require a certain number of compute units (see the chart below). Instances will not be inexpensive. Running a lightweight version of GPT-3.5 will cost $78,000 over three months or $264,000 over one year. To put that in context, one of Nvidia’s most recent-generation supercomputers, the DGX Station, costs $149,000 per unit.

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

More like this

Musk: Polymarket ‘more accurate than polls, as actual money...

The billionaire mogul’s opinion comes as he ramps...

macOS Sequoia 15.1 will prompt you less often for...

With today’s release of macOS Sequoia 15.1 beta...

US judge orders Google to open up Google Play...

A federal judge has ordered Google to open...

Popular

Upcoming Events

Startup Information that matters. Get in your inbox Daily!