HP pushes local AI compute as cloud costs hit $37B

Enterprise AI costs surge as HP pitches local hardware

Enterprise GenAI spending reached $37 billion in 2025, with 80% of companies missing cost forecasts by more than 25% (company-reported), according to HP's AI & Data Science Business Development Manager Jerome Gabryszewski. HP claims its on-premises infrastructure delivers up to an 18x cost advantage per million tokens over cloud APIs across a five-year lifecycle (company analysis).

The company positions its ZGX Nano as handling models up to 200 billion parameters locally, while the ZGX Fury targets trillion-parameter inference at the desktop level. HP argues the cost problem is structural: unit inference costs fall but total spending rises because usage growth outpaces cost reductions.

Gartner projects 40% of enterprise applications will embed AI agents by end-2026, up from under 5% a year ago, but only 20% of companies have mature governance models for autonomous AI systems (analyst estimate).

Data sovereignty drives hardware recalculation

The shift reflects deeper architectural tensions beyond pure cost optimization. Gabryszewski frames the core issue as data sovereignty: "Sending proprietary data to a cloud model for processing isn't just an exposure risk, it's a governance failure waiting to happen, especially in regulated industries."

HP's recommended architecture centers on Retrieval-Augmented Generation running locally, keeping proprietary data on-premises while powering AI workflows. This approach addresses compliance requirements that make even transmitting sensitive data externally problematic for regulated industries.

The company advocates a three-tier model: cloud for burst training and frontier model access, on-premises for predictable high-volume inference, and edge compute for latency-critical applications. This segmentation challenges the cloud-first assumption that dominated early enterprise AI adoption.

Local compute requires workflow discipline

HP's cost advantage claims rest on distinguishing experimental from production workloads. The company recommends running early iterative work like prototyping and model evaluation on local hardware to avoid burning operational budget on experiments without clear ROI paths.

For teams implementing RAG systems, HP suggests enforcing role-based permissions at the retrieval level, ensuring AI surfaces only information employees are entitled to access. This mirrors existing document management controls while maintaining local data residency.

The governance challenge intensifies as IT teams transition from executing tasks to designing and governing AI agents. HP argues local infrastructure provides full observability over agent behavior that cloud-abstracted workloads cannot match, making compliance auditing more straightforward for enterprises with strict regulatory requirements.

HP pushes local AI compute as cloud costs hit $37B

Our Take

Why it matters

Do this week

Enterprise AI costs surge as HP pitches local hardware

Data sovereignty drives hardware recalculation

Local compute requires workflow discipline

Related stories

Gresham and FundGuard merge data platforms for asset managers

ANNA Money adds 3.66% savings account for UK small businesses

Payward buys Reap for $600M to merge stablecoin cards with B2B rails