News Posts matching #NVIDIA H100

May 8th, 2025 15:06 Discuss (3 Comments)

The Trump administration is preparing to roll back significant portions of the Biden-era rules that govern global chip exports and AI technology transfers. According to Bloomberg's sources familiar with the matter, the so-called "AI diffusion" framework, due to take effect on May 15, will be scrapped in favor of simpler, bilateral licensing agreements. Under the current plan, rather than sorting about 120 countries into three tiers with differing volume caps, the US will negotiate individual contracts with partners like the United Arab Emirates and Saudi Arabia. Even with these changes, restrictions on China's access to advanced chips will stay firmly in place, and may even be reinforced. The proposed regulations are expected to maintain the outright ban on shipments to China, Russia, Iran, and North Korea, while adding stricter oversight for nations that have previously rerouted US-origin semiconductors toward Beijing's AI and military programs. US officials are also considering lowering the notification threshold for smaller shipments, from 1,700 NVIDIA H100 equivalent units down to around 500, to close loopholes used by alleged smuggling networks.

Industry reaction has been mixed but largely positive. Chipmakers saw their share prices climb when news of the repeal broke, citing hopes for clearer rules and fewer compliance headaches. Governments in Southeast Asia and Eastern Europe are watching closely and urging Washington to provide detailed guidance during the transition to avoid market disruptions. The AI diffusion rule was introduced in January 2025 with the goal of drawing countries such as India, Malaysia, and Poland into a more stringent export regime. Critics have argued that its complex, tiered system stifled innovation and diplomatic flexibility. The incoming framework will instead rely on targeted, outcome-driven accords that tie access to strategic investments and broader trade incentives. An official announcement could come as soon as Thursday, just before President Trump's trip to the Middle East. Final details are expected to be released in the coming weeks, marking a new chapter in US semiconductor diplomacy.

NVIDIA Dismisses Anthropic's Report of Ludicrous GPU & CPU Smuggling Methods

T0@st

May 2nd, 2025 18:25 Discuss (16 Comments)

The first couple of paragraphs within Anthropic's "Securing America's Compute Advantage: (Our) Position on the Diffusion Rule" article are standard fare. Roughly half-way through a read of this policy-related piece, the North American (Amazon-backed) AI startup makes some bizarre claims about the smuggling of AI-oriented products into China. Given ongoing global tensions and growing industry demands, these activities are somewhat expected—but Anthropic leadership described very specific methodologies. As stated within their "Chip Smuggling is a Major Threat" passage: "China has established sophisticated smuggling operations, with documented cases involving hundreds of millions of dollars worth of chips. In some cases, smugglers have employed creative methods to circumvent export controls, including hiding processors in prosthetic baby bumps and packing GPUs alongside live lobsters." Specific bits of hardware were not mentioned in this section, but the author later alludes to the frictionless transfer of thousands of "NVIDIA H100 advanced chips" into Chinese territories.

In a statement issued to CNBC, a Team Green spokesperson dismissed Anthropic's fanciful claims: "American firms should focus on innovation and rise to the challenge, rather than tell tall tales that large, heavy, and sensitive electronics are somehow smuggled in 'baby bumps' or 'alongside' live lobsters." This very public spat has received mainstream attention; with further coverage documenting additional "to and fro" barbs. NVIDIA criticized Anthropic's anti-foreign competition stance: "China, with half of the world's AI researchers, has highly capable AI experts at every layer of the AI stack. America cannot manipulate regulators to capture victory in AI." Amusingly, Anthropic's operations rely heavily on Team Green hardware—many online critics reckon that top US AI companies are jostling for priority access to cutting-edge GPUs/accelerators. In reaction to NVIDIA's dismissal of their report, a company spokesperson retorted with: "Anthropic stands by its recently filed public submission in support of strong and balanced export controls that help secure America's lead in infrastructure development and ensure that the values of freedom and democracy shape the future of AI."

US to Implement Bilateral Licensing Framework for AI Chips

AleksandarK

May 1st, 2025 11:59 Discuss (0 Comments)

The Trump administration is preparing substantial changes to the Biden-era Framework for AI Diffusion controlling advanced semiconductor exports. Sources close to the Reuters indicate officials will replace the current three-tier country classification with a unified government-to-government licensing system requiring bilateral approval for US chip acquisitions. The existing framework, implemented in January 2025, permits unrestricted exports to 17 allied nations plus Taiwan, imposes volume caps on roughly 120 countries and blocks shipments to China, Russia, Iran, and North Korea. Current regulations exempt orders below 1,700 NVIDIA H100 equivalent units from full licensing requirements, needing only a notification.

Former Commerce Secretary Wilbur Ross, acting as an informal adviser, verified that bilateral government agreements are under review. Officials are also considering reducing the notification threshold from 1,700 to approximately 500 H100 equivalents to address circumvention concerns. The proposal has drawn criticism from industry figures, including Oracle Executive VP Ken Glueck and a coalition of seven Republican senators who have urged Commerce Secretary Howard Lutnick to withdraw the existing framework entirely. The administration faces pressure to finalize regulations before the May 15 compliance deadline, balancing security objectives with trade considerations. An announcement is expected before the month's end.

MangoBoost Achieves Record-Breaking MLPerf Inference v5.0 Results with AMD Instinct MI300X

Press Release by

T0@st

Apr 4th, 2025 20:55 Discuss (0 Comments)

MangoBoost, a provider of cutting-edge system solutions designed to maximize AI data center efficiency, has set a new industry benchmark with its latest MLPerf Inference v5.0 submission. The company's Mango LLMBoost AI Enterprise MLOps software has demonstrated unparalleled performance on AMD Instinct MI300X GPUs, delivering the highest-ever recorded results for Llama2-70B in the offline inference category. This milestone marks the first-ever multi-node MLPerf inference result on AMD Instinct MI300X GPUs. By harnessing the power of 32 MI300X GPUs across four server nodes, Mango LLMBoost has surpassed all previous MLPerf inference results, including those from competitors using NVIDIA H100 GPUs.

Unmatched Performance and Cost Efficiency
MangoBoost's MLPerf submission demonstrates a 24% performance advantage over the best-published MLPerf result from Juniper Networks utilizing 32 NVIDIA H100 GPUs. Mango LLMBoost achieved 103,182 tokens per second (TPS) in the offline scenario and 93,039 TPS in the server scenario on AMD MI300X GPUs, outperforming the previous best result of 82,749 TPS on NVIDIA H100 GPUs. In addition to superior performance, Mango LLMBoost + MI300X offers significant cost advantages. With AMD MI300X GPUs priced between $15,000 and $17,000—compared to the $32,000-$40,000 cost of NVIDIA H100 GPUs (source: Tom's Hardware—H100 vs. MI300X Pricing)—Mango LLMBoost delivers up to 62% cost savings while maintaining industry-leading inference throughput.

News Posts matching #NVIDIA H100

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts