Search

Showing top 112 results for "AI cost and tokens"

All sources blogs.nvidia.com 19 wccftech.com 16 developer.nvidia.com 9 tomshardware.com 8 techcrunch.com 8 theregister.com 8 huggingface.co 7 amd.com 5 techpowerup.com 3 theverge.com 2 androidauthority.com 2 engadget.com 2

People also ask

What Are the Factors That Lower Token Cost?

Understanding how to optimize token cost requires looking at the equation for calculating cost per million tokens. In this equation, many enterprises evaluating AI infrastructure focus on the numerator: the cost per GPU per hour. For cloud deployments, this is the hourly rate paid to a cloud provider; for on-premises deployments, it’s the effective hourly cost derived from amortizing owned infrastructure. The real key to reducing token cost, however, lies in the denominator: maximizing the delivered token output. That denominator carries two business implications. Minimize token cost: When thi

Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters

Why Does Cost per Token Matter Much More Than FLOPS per Dollar?

The following data for the DeepSeek-R1 AI model demonstrates the difference between theoretical and actual business outcomes. Looking at compute cost alone, the NVIDIA Blackwell platform appears to cost roughly 2x more than NVIDIA Hopper — but compute cost says nothing about the output that investment buys. An analysis of mere FLOPS per dollar suggests a 2x NVIDIA Blackwell advantage compared with the NVIDIA Hopper architecture. However, the actual outcome is orders of magnitude different: Blackwell delivers more than 50x greater token output per watt than Hopper, resulting in nearly 35x lower

Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters

Videos

Groq's Inference Chips Are Beating NVIDIA's Blackwell by 5x on Cost - And Doing It Twice as Fast

…Towards Cost Per Million Tokens From GPU Per Hour, Says Expert According to the expert, current pricing in the AI infrastructure industry depends on the kind of GPU being used and whether…

Apr 23, 2026 · Ramish Zafar

Amazon Is the Latest Tech Giant to Face the Consequences of AI 'Tokenmaxxing'

…Amazon says it measured token usage to understand cost and efficiency but discouraged using those metrics to measure developer productivity. Companies are pulling back on AI overuse Amazon is one of several…

May 29, 2026 · See full bio

Corporations rein in AI usage, citing high token costs

…action games and gym time. > Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2026 05 > Corporations rein in AI usage, citing high token costs Christopher Harper…

May 31, 2026 · Christopher Harper

AI Factories: The New Infrastructure of Intelligence

…tokens per second, tokens per watt, cost per token, utilization and uptime. In this model, performance per watt translates directly into revenue. Cost per token impacts the economics of every AI factory…

May 27, 2026 · Jeremy Graybill

Retail Archives

…The NVIDIA Blackwell architecture lowered cost per million tokens by 15x versus the previous generation, leading to substantial savings and fostering wider AI deployment and innovation. How Does Blackwell Balance Cost, Throughput…

May 14, 2026

Banking Archives

May 7, 2026

4 sources covering this — show 3 more

‘Pretty Crazy’ Token Usage Is Testing Bosses’ Bet on AI

…into AI tools for coding, marketing, and customer service, a new obsession has emerged in the tech industry: “tokenomics,” or how to manage the soaring cost of AI usage. (Tokens represent the…

Jun 16, 2026 · Paresh Dave

Discussions and forums

Hacker News · u/tinyopsstudio · 4w ago

Followed topics

Search

People also ask

Videos

Groq's Inference Chips Are Beating NVIDIA's Blackwell by 5x on Cost - And Doing It Twice as Fast

Amazon Is the Latest Tech Giant to Face the Consequences of AI 'Tokenmaxxing'

Corporations rein in AI usage, citing high token costs

AI Factories: The New Infrastructure of Intelligence

Top stories

Ditching the cloud for local AI — how I use two mini PCs to process millions of tokens a day and save money on costly API fees

Microsoft Risks Trump's Ire By Abandoning The Costly OpenAI And Anthropic Models For China-Based DeepSeek's V4 Model For Enterprise Workloads

Tensordyne's 3nm Napier AI Chip Promises 13x Higher Token Throughput Than Blackwell & Blazes Past Rubin With 1000 Tokens/s In Multi-Trillion Parameter Models

Amazon Could Turn to Qualcomm's 768GB AI200 Chips as AWS Races to Slash Inference Costs Choking Margins

Retail Archives

Banking Archives

‘Pretty Crazy’ Token Usage Is Testing Bosses’ Bet on AI

Discussions and forums

Show HN: AI agent token cost calculator for Codex and Claude Code loops

Value for Money Is All You Need

Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

Show HN: Token Usage Meter 12 Providers and Coding Agent

Show HN: Open-source CLI to see your AI coding token usage and compare it