Inference

Hardware for AI: Self-Build Recommendations and the Inference Landscape April 2, 2026
A living guide to building your own AI-capable and gaming machine. Three tiers at £500, £800, and £1500 for AI inference and gaming/general purpose, GPU quick reference, and what to avoid in 2026. Updated 1 Apr 2026: AMD RX 9060 XT 16GB arrives at the RTX 5060 Ti 8GB MSRP price, reshaping mid-tier options; DDR5-5200 documented rising from $100 to $400+ in five months; US semiconductor tariffs take effect today.
Intel Arc Pro B70: 32GB for $949 and What It Does to the Inference Cost Equation March 28, 2026
Intel launched the Arc Pro B70 on 25 March 2026 -- 32GB GDDR6, 608 GB/s bandwidth, $949. That's more VRAM than Nvidia's $1,800 RTX Pro 4000 Blackwell, at nearly half the price. The VRAM:price calculus for local AI inference just shifted.
Building a Local AI Machine: Three Builds at £500, £800, and £1500 March 23, 2026
A practical buying guide for engineers who want to run local AI models and agents in 2026. Three tiers at £500, £800, and £1500, with honest assessments of what each actually runs.
How MoE Sparsity and Apple Silicon SSD Architecture Make 397B Local Inference Possible March 23, 2026
Flash-MoE runs a 397-billion-parameter model on a MacBook Pro with 5.5GB of active RAM by combining MoE weight sparsity with Apple Silicon's direct SSD-to-GPU memory architecture. This is a specific technical convergence, not a general trick, and understanding why it works on Apple Silicon but not on a standard PC changes how you think about hardware selection for local inference.