The steady increase in artificial intelligence (AI) workloads and data center thirst for more power can be supported by siting AI compute centers with renewable energy systems. An approach proposed by Microsoft Research envisions shifting large-scale AI inference — the process by which machine learning models draw conclusions from new data — to modular data centers co-located with wind farms to bypass overtaxed electric grids and exploit abundant, underutilized green energy.

More than six million high-end graphics processing units (GPUs) could be situated at wind farm sites to tap into low-cost green power at its source. A critical element of this so-called AI Greenferencing scheme is Heron, a new cross-site software router designed to efficiently manage the variable nature of wind power by dynamically routing workloads across distributed compute clusters.

By co-locating compute with energy sources and leveraging Heron to route tasks based on real-time power availability, developers can seize the opportunity to appropriately size and configure GPU deployments in order to meet user needs in regions where AI demand outpaces grid capacity.

The study published in arXiv Computer Science confirms that wind energy input defined by Heron improves aggregate goodput of AI compute — ratio between delivered amount of information and the total delivery time — by up to 80% compared to the state-of-the-art.

To contact the author of this article, email shimmelstein@globalspec.com