OpenAI’s Foundry will let customers buy dedicated compute to run its AI models

Share via:

OpenAI is quietly launching a new developer platform that will allow customers to run the company’s more recent machine learning models, such as GPT-3.5, on dedicated capacity. In screenshots of documentation shared on Twitter by early access users, OpenAI describes the upcoming Foundry offering as “designed for cutting-edge customers running larger workloads.”

Foundry will provide service-level commitments such as uptime and on-demand engineering support. Rentals will be based on dedicated compute units with three-month or one-year commitments; each model instance will require a certain number of compute units (see the chart below). Instances will not be inexpensive. Running a lightweight version of GPT-3.5 will cost $78,000 over three months or $264,000 over one year. To put that in context, one of Nvidia’s most recent-generation supercomputers, the DGX Station, costs $149,000 per unit.

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Popular

More Like this

OpenAI’s Foundry will let customers buy dedicated compute to run its AI models

OpenAI is quietly launching a new developer platform that will allow customers to run the company’s more recent machine learning models, such as GPT-3.5, on dedicated capacity. In screenshots of documentation shared on Twitter by early access users, OpenAI describes the upcoming Foundry offering as “designed for cutting-edge customers running larger workloads.”

Foundry will provide service-level commitments such as uptime and on-demand engineering support. Rentals will be based on dedicated compute units with three-month or one-year commitments; each model instance will require a certain number of compute units (see the chart below). Instances will not be inexpensive. Running a lightweight version of GPT-3.5 will cost $78,000 over three months or $264,000 over one year. To put that in context, one of Nvidia’s most recent-generation supercomputers, the DGX Station, costs $149,000 per unit.

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

More like this

Former Roomba C.E.O. Reminisces on 2008’s Viral ‘Shark Cat’...

new video loaded: Former Roomba C.E.O. Reminisces on 2008’s...

The New Operational Stack: From IaC to Platforms to...

It’s been 10 years since Infrastructure as Code...

Texas judge blocks state’s App Store age verification law

The decision comes just a few days before...

Popular

Upcoming Events