The post NVIDIA weighs Groq as Samsung 3nm yields in focus appeared on BitcoinEthereumNews.com. NVIDIA Groq inference chip shifts decode to LPUs to improve latencyThe post NVIDIA weighs Groq as Samsung 3nm yields in focus appeared on BitcoinEthereumNews.com. NVIDIA Groq inference chip shifts decode to LPUs to improve latency

NVIDIA weighs Groq as Samsung 3nm yields in focus

For feedback or concerns regarding this content, please contact us at [email protected]

NVIDIA Groq inference chip shifts decode to LPUs to improve latency

NVIDIA is previewing an inference chip that integrates Groq technology to offload token-by-token decode onto low-latency processing units while leaving training on GPUs. according to Tom’s Hardware, corporate statements describe integrating Groq’s processors into the NVIDIA AI Factory architecture to expand coverage for real-time inference.

This design aligns with an industry shift that separates the prefill phase from decode in large-model inference. as reported by VentureBeat, the split enables specialized hardware to target latency-critical decode while GPUs handle bulk prefill compute.

Why it matters: prefill vs decode, cost and energy

Placing prefill on GPUs and decode on LPUs is intended to cut user-perceived latency and smooth tail behavior under load. DA Davidson notes that Groq-style designs can face memory-capacity limits, so gains may vary across model sizes and concurrency profiles.

Analysts frame this as an inference-share play where latency and efficiency drive unit economics at scale. “NVIDIA can take even greater share of the inference market,” said CJ Muse, Senior Managing Director at Cantor Fitzgerald, emphasizing both offensive and defensive motives.

Inference costs increasingly dominate total AI spend as usage scales. WisdomAI reports that this moves buyer focus from peak FLOPS toward cost per token and energy per query, especially for high-volume consumer and enterprise assistants.

OpenAI is widely reported, but not officially confirmed in detail, as a potential first production-scale user of NVIDIA’s Groq-based inference chip. According to AIwire, this would reflect a hedging strategy to secure lower-latency, lower-cost inference capacity.

Production risk may hinge on Samsung’s leading-edge process readiness if it handles first foundry builds. PhoneArena reports persistent low yields in Samsung’s 3 nm and 2 nm nodes relative to TSMC, a factor that could influence client confidence and delivery timing.

Supply chain and inference unit economics outlook

Samsung Foundry production readiness and client confidence versus TSMC

Client caution remains elevated at the leading edge. As reported by EE Times, some fabless customers are favoring TSMC due to concerns about Samsung’s yields and delivery reliability.

Samsung has responded with leadership moves focused on defect analysis and metrology to improve 3 nm and 2 nm yields. Biz Chosun reports these changes, while En. Sedaily adds that Tesla’s AI5 volume may be split between Samsung and TSMC, signaling conditional confidence if yields stabilize.

Latency, cost per token, and energy per query at scale

Separating prefill from decode provides a placement framework: keep bandwidth-heavy, sequence-initialization work on GPUs, and move token-generation loops to LPUs where serialization dominates. Bernstein has highlighted this bifurcation as the core architectural trend in inference.

The expected outcome is lower tail latency and improved energy-per-query, with cost gains accruing where decode dominates runtime. WisdomAI notes that as inference volumes outgrow training, these unit economics become decisive for platform competitiveness.

FAQ about NVIDIA Groq inference chip

Is OpenAI confirmed as the first customer for NVIDIA’s Groq-based inference chip and what advantages would it gain?

OpenAI is not officially confirmed. Reports indicate it could gain lower latency and better unit economics if decode shifts to LPUs.

How do prefill vs decode stages map to GPUs vs LPUs, and which models or workloads benefit most?

GPUs handle prefill; LPUs target decode. Latency-sensitive assistants and streaming token generation benefit most, subject to memory and model-size constraints.

Source: https://coincu.com/news/nvidia-weighs-groq-as-samsung-3nm-yields-in-focus/

Market Opportunity
Overtake Logo
Overtake Price(TAKE)
$0.02191
$0.02191$0.02191
-0.18%
USD
Overtake (TAKE) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

DBS Tests Repo With Ripple RLUSD and Franklin sgBENJI

DBS Tests Repo With Ripple RLUSD and Franklin sgBENJI

The post DBS Tests Repo With Ripple RLUSD and Franklin sgBENJI appeared on BitcoinEthereumNews.com. Ripple, DBS, and Franklin Templeton launch tokenized repo pilot on DBS Exchange. Repo trades use Ripple’s RLUSD stablecoin and Franklin Templeton’s sgBENJI token. sgBENJI issued on XRP Ledger enables fast collateralized lending and settlements. DBS, Ripple, and Franklin Templeton have signed a memorandum of understanding to bring repo transactions into tokenized finance. The framework pairs Ripple’s RLUSD stablecoin with Franklin Templeton’s sgBENJI tokenized money market fund, listed on DBS Digital Exchange. The setup gives accredited clients a path to rebalance cash into a regulated, yield-bearing vehicle while transacting with stablecoins that settle within minutes. For institutions used to overnight repo desks, this is a first look at how traditional liquidity tools can migrate onto public blockchains. Related: Franklin Templeton Launches its DeFi Solution Benji on Ethereum Demand From Institutions Shapes the Design The three firms cited rising demand for digital asset allocations, with surveys showing nearly nine in ten institutional investors plan to increase exposure in 2025. The repo model was chosen because it mirrors an existing backbone of global funding markets: collateralized lending against short-term securities. By allowing RLUSD to trade directly against sgBENJI on DBS Digital Exchange, desks can manage intraday liquidity, park stablecoin reserves into a fund earning regulated yield, and unwind positions quickly when cash is needed. DBS to Expand Collateralized Lending The next phase extends sgBENJI beyond a trading instrument into repo collateral. DBS plans to let investors pledge sgBENJI against credit lines arranged through the bank or third-party lenders. That opens deeper liquidity pools with the assurance that collateral sits inside a regulated balance sheet. For trading desks, that means onchain repo could eventually function like its traditional counterpart, rolling positions overnight, secured by tokenized assets that settle in near real-time. XRP Ledger as the Settlement Rail Franklin Templeton will issue sgBENJI tokens on…
Share
BitcoinEthereumNews2025/09/18 20:25
Pepeto Attracts Capital As Early Shiba Inu And Pepe Investors Hunt Big Gains And The Next 100x Story

Pepeto Attracts Capital As Early Shiba Inu And Pepe Investors Hunt Big Gains And The Next 100x Story

The post Pepeto Attracts Capital As Early Shiba Inu And Pepe Investors Hunt Big Gains And The Next 100x Story appeared first on Coinpedia Fintech News Early Shiba Inu and PEPE stories are legendary. Some first movers turned $1,000 into well over $1,000,000 as SHIB ran more than 26,000% in 2021, while PEPE delivered multi-thousand % bursts for the earliest entries. After riding those arcs, many of those holders are hunting the next big move, shifting from SHIB to PEPE and …
Share
CoinPedia2025/09/18 19:02
A 3821% surge in 20 years: Why are Pokémon cards valuable investments?

A 3821% surge in 20 years: Why are Pokémon cards valuable investments?

By David Unyime Nkanta Compiled by: TechFlow The Pokémon trading card game is extremely popular around the world, especially in Japan. These cards are very valuable, especially the rare ones. (Image source: Twitter / FADA Pack Magic @FadaPackMagic) Pokémon trading cards have gone from amusement park items to one of the world's hottest alternative investments. According to data from analytics firm Card Ladder, the Pokémon card market has grown 3,821% in value since 2004, far outpacing the S&P 500's 483% increase and Meta Platforms' 1,844% growth. From hobby to high-yield asset Pokémon trading cards, launched by Nintendo in 1996, have become a popular investment, traded across platforms including eBay, TCGplayer, and international expos. The market has seen explosive growth during the pandemic, as stimulus policies and lockdowns have driven collectors toward alternative assets. For some, the investment has yielded life-changing returns. Lucas Shaw, a 27-year-old account manager in Ohio, said the profits from selling the cards helped him pay for his wedding rings and celebrations. Similarly, Justin Wilson, a 32-year-old advertising manager in Oklahoma City, estimates the total value of his collection of 500 cards and 100 sealed items at about $100,000. He considers Pokémon cards part of his investment portfolio, alongside his Roth IRA and securities accounts. The appeal of Pokémon cards lies not only in financial gain but also in their emotional resonance. "You have to collect them all," Wilson said, referencing the series's classic slogan. For many, the cards represent both childhood nostalgia and speculative opportunity. Where does the value of rare Pokémon cards come from? A classic Poké Ball toy with matching Pokémon trading cards. Zapdos, Ninetales, and a trainer card are clearly visible. Image credit: Thimo Pedersen/Unsplash Unlike stocks, Pokémon cards don't generate dividends; their value depends on their rarity, condition, and cultural significance. Cards graded as perfect PSA 10 by the Professional Sports Authenticator (PSA) often fetch exorbitant prices. The most dramatic example occurred in 2022, when influencer Logan Paul purchased a near-perfect "Pikachu Illustrator" card for $5.3 million, setting a Guinness World Record for the most expensive Pokémon card ever sold privately. This event further ignited market interest and highlighted the speculative potential of high-level cards. Risks of the Pokémon Card Market Financial advisors warn against considering collectibles as the core of a portfolio. Card prices are extremely volatile, influenced by hype, media coverage, and collector sentiment. Counterfeit cards also remain a potential threat, with scams frequently occurring. Image source: Flickr/c0rnnibblets Still, the resilience of the Pokémon brand provides some stability to the market. Pokémon spans video games, movies, and merchandise, and unlike sports trading cards, the characters are immune to scandals, making them a safer investment for some collectors. The Future of Collectibles Investing The rapid rise of Pokémon cards reflects a broader shift in people's perception of value. As digital assets like Bitcoin face regulatory scrutiny and tech stocks undergo a market correction, tangible collectibles offer a nostalgic and potentially profitable haven. While the sustainability of its value remains uncertain, the 3,821% growth over the past 20 years has established Pokémon trading cards as the most vivid example of how a childhood hobby can transform into a multi-million dollar investment.
Share
PANews2025/09/18 18:00