Google: OpenAI disbands research SRE team

Sep 13, 2024 | Posted by Abdul-Rahman Oladimeji

 OpenAI has disbanded its Site Reliability Engineering team focused on research and training workloads which was formed less than a year ago. Todd Underwood who was hired to lead the team  spent 14 years and nine months at Google, where he created the machine learning SRE group, and co-authored the O'Reilly book Reliable Machine Learning
 

"I have not been successful in my attempt to start an SRE team within the research organization at OpenAI," Underwood said in a LinkedIn post. "OpenAI has eliminated the reliability function in research and redistributed the individual contributors into the remaining engineering teams on the research platform organization. I’m no longer an employee of OpenAI."

He added: "There are a few things I’m really good at. But building a new reliability function inside of this particular frenetic research startup remotely turns out to not have been one of them. I have thoughts about why this didn’t go as well as it could have but they’re not really relevant here."