Fair, Practical, and Efficient Carbon Accounting for LLM Serving
Yueying Lisa Li, Leo Han, G Edward Suh, Christina Delimitrou, Fiodar Kazhamiaka, Esha Choukse, Rodrigo Fonseca, Liangcheng Yu, Jonathan Mace, and Udit Gupta
SIGMETRICS Perform. Eval. Rev., Aug 2025
We propose a framework for evaluating carbon attribution methods for multi-tenant LLM serving. The framework formalizes the problem using three key components: (1) a set of requests with varying prompt and decode lengths, (2) the LLM inference runtime including batching algorithms, and (3) a carbon emission model accounting for both operational carbon (proportional to power consumption and carbon intensity) and embodied carbon from hardware manufacturing. Using the Shapley value as ground truth for fair attribution, we demonstrate why simple ’leave-one-out’ attribution methods fail to satisfy efficiency properties. The framework evaluates attribution methods against four criteria: scalability (computational complexity), fairness (minimizing deviation from Shapley values), sample efficiency (algorithmic approximations for complex cases), and incentivization (encouraging users to optimize their usage patterns).