Dedicated AI infrastructure helps ensure data is controlled, compliant and secure, while models remain accurate and differentiated. However, this reassurance comes at a cost that may not be justified compared with cheaper options.
Dr. Owen Rogers is Uptime Institute’s Senior Research Director of Cloud Computing. Dr. Rogers has been analyzing the economics of cloud for over a decade as a chartered engineer, product manager and industry analyst. Rogers covers all areas of cloud, including AI, FinOps, sustainability, hybrid infrastructure and quantum computing.
orogers@uptimeinstitute.com
Dedicated AI infrastructure helps ensure data is controlled, compliant and secure, while models remain accurate and differentiated. However, this reassurance comes at a cost that may not be justified compared with cheaper options.
A new wave of GPU-focused cloud providers is offering high-end hardware at prices lower than those charged by hyperscalers. Dedicated infrastructure needs to be highly utilized to outperform these neoclouds on cost.
Hyperscalers design their own servers and silicon to scale colossal server estates effectively. AWS uses a system called Nitro to offload virtualization, networking and storage management from the server processor onto a custom chip.
Uptime Intelligence surveys the data center industry landscape to look deeper at what can actually happen in 2025 and beyond based on the latest trends and developments. The stronghold that AI has on the industry is a constant discussion - but how…
Dedicated GPU infrastructure can beat the public cloud on cost. Companies considering purchasing an AI cluster need to consider utilization as the key variable in their calculations.
Uptime Intelligence looks beyond the more obvious trends of 2025 and examines some of the latest developments and challenges shaping the data center industry.
Cloud providers need to win AI use cases in their early stages of development. If they fail to attract customers, their AI applications may be locked-in to rival platforms and harder to move, which can have serious repercussions.
Visibility into costs remains a top priority for enterprises that are consuming cloud services. Improving the tagging of workloads and resources may help them to spot, and curb, rising costs.
The cost and complexity of deploying large-scale GPU clusters for generative AI training will drive many enterprises to the public cloud. Most enterprises will use pre-trained foundation models, to reduce computational overheads.
While the aim of FinOps is to manage just the cloud costs, technology business management seeks to aggregate all costs of IT, including data centers, servers, software and labor, to identify savings and manage return on investment.
Enterprises have various options on how and where to deploy their AI training and inference workloads. This report explains how these different options balance cost, complexity and customization.
To meet the demand driven by AI workloads, a new breed of cloud provider has emerged, delivering inexpensive GPU infrastructure as a service. Their services are highly demanded today, but longer-term, the market is ripe for consolidation.
While GPUs are the power-hungry devices that enable effective AI training, it is innovations in software that are fueling the recent surge in interest and investment. This report explains how neural networks power generative AI.
Although quantum computing promises a revolution in scientific discovery, its use is still constrained to research and continuing development. However, a new IBM quantum data center in Germany signals a growing interest in its capabilities.
Reserved instances are a pricing model for virtual machines offered by cloud providers. As they offer savings of up to 70% compared with on-demand pricing, organizations should use them liberally, especially in challenging times.