Semiconductor News & Analysis Feed
69 articles
2026-06-16
developer.nvidia.com
2026-06-16
NVIDIA Developer
Mixture-of-experts (MoE) models have quickly become a foundational component of modern, large-scale AI systems. They are widely adopted because they enable substantially larger model capacity while activating only a subset of parameters for each token, offering an unparalleled approach for scaling performance within a practical compute budget. As model scales continue to grow, the optimization of
2026-06-15
www.tradingview.com
2026-06-15
TradingView
News
/
Reuters
/
Sunrise Energy Metals Agrees To Strategic Investment In Advanced Alscn Memory Semiconductor Developer Agni Semiconductor
Sunrise Energy Metals Agrees To Strategic Investment In Advanced Alscn Memory Semiconductor Developer Agni Semiconductor
RefinitivLess than 1 min read
SRL
−3.82%
© Copyright Thomson Reuters 2026. Click For Restrictions - https://agency.reuters.com/en/copyright.h
2026-06-14
tomshardware.com
2026-06-14
Denise Bertacchi
Snapmaker celebrates 10 years in business by sponsoring open-source developers and you.
2026-06-13
tomshardware.com
2026-06-13
Kunal Khullar
Powered by the Ryzen AI Max+ 395 processor and 128GB of unified memory, AMD's developer kit arrives as a direct competitor to Nvidia's DGX Spark, which recently saw a price increase to $4,699.
2026-06-13
digitimes.com
2026-06-13
India has expanded exemptions from mandatory quality certification requirements for imports by Special Economic Zone (SEZ) units and developers, a policy change that industry observers say could ease the establishment of semiconductor manufacturing facilities in the country.
2026-06-13
developer.nvidia.com
2026-06-13
NVIDIA Developer
AI agents have fundamentally changed the complexity of inference workloads. Until now, the industry has struggled to define a standard for measuring how inference systems perform under these conditions. Artificial Analysis AgentPerf (AA-AgentPerf) offers the industry’s first multi-vendor open benchmarks profiling trajectories that are representative of real-world AI agent coding tasks.
This post
2026-06-12
news.google.com
2026-06-12
Phoronix
2026-06-12
news.google.com
2026-06-12
NVIDIA Developer
2026-06-12
developer.nvidia.com
2026-06-12
NVIDIA Developer
NVIDIA Quantum InfiniBand now offers intent-based security profiles in Unified Fabric Manager (UFM) that enable multi-tenant fabric security in a single click.
NVIDIA Quantum InfiniBand supports three profiles: General, Bare Metal Cloud, and Secured Bare Metal Cloud. Network administrators can now auto-configure:
Partition Key (PKey) isolation
Management Datagram (MAD) key protection
Global Uni
2026-06-11
developer.nvidia.com
2026-06-11
NVIDIA Developer
Developers building real-time AI—such as chat assistants, copilots, and agentic workflows—are often constrained by token-by-token generation speed. This limits responsiveness, increases serving costs, and makes fluid, interactive experiences difficult to achieve.
DiffusionGemma, created by Google DeepMind and optimized to run efficiently across NVIDIA platforms, introduces a new approach to tex
2026-06-10
developer.nvidia.com
2026-06-10
NVIDIA Developer
AI factories are changing what data-center infrastructure must do.
Unlike traditional data centers, AI factories are built to manufacture intelligence at scale. They run power-dense training and inference workloads, increasingly support agentic and reasoning models, and must deliver predictable performance even as compute demand shifts rapidly. In this environment, electrical infrastructure is no
2026-06-10
developer.nvidia.com
2026-06-10
NVIDIA Developer
As AI infrastructure scales, enterprise expectations for operational maturity are increasing. Organizations expect these systems to be provisionable, observable, secure, and manageable at scale—the same standard applied to all critical infrastructure. The moment an AI system moves from development into enterprise deployment, that operational foundation is essential.
NVIDIA DGX Spark and NVIDIA GB
2026-06-10
developer.nvidia.com
2026-06-10
NVIDIA Developer
Converting a quantized checkpoint into an NVIDIA TensorRT engine bridges the gap between model optimization and production deployment, enabling faster inference, higher throughput, and more efficient GPU utilization at scale.
In a previous post, we produced a high-quality FP8-quantized Contrastive Language-Image Pretraining (CLIP) checkpoint with NVIDIA TensorRT Model Optimizer.
This post picks
2026-06-10
developer.nvidia.com
2026-06-10
NVIDIA Developer
Federated learning (FL) research often begins with a deceptively simple question: What should we try next? A new aggregation rule, a FedProx coefficient, a server optimizer setting, a SCAFFOLD variant, or a model architecture tweak may all look promising before an experiment starts.
After the run finishes, the harder questions begin: Did the change actually improve the metric? Was the comparison
2026-06-09
tomshardware.com
2026-06-09
Aaron Klotz
Linux developer uses AI to help update Linux GPU driver support for vintage HD 2000 - HD 6000 series.
2026-06-09
digitimes.com
2026-06-09
Computex 2026 has ended, with the spotlight again firmly on Nvidia CEO Jensen Huang. From his arrival in Taiwan on May 23, Huang spent two weeks meeting key industry figures, attending Nvidia developer events, GTC Taipei, and a Computex tour, and once again hosting his "trillion-dollar banquet."
2026-06-09
developer.nvidia.com
2026-06-09
NVIDIA Developer
Pre-training frontier LLMs comes down to throughput. When training spans trillions of tokens across thousands of accelerators, every percentage point of step time can add up to days of training and substantial compute costs. Numerical precision is one of the highest-leverage knobs available, but low- bit mixed-precision pretraining is hard to get right.
To address this, the NVFP4 training recipe
2026-06-09
tomshardware.com
2026-06-09
Mark Tyson
Texas farmland originally donated in 1999 to be used only as a public park has been sold to a data center developer for $10 million.
2026-06-08
digitimes.com
2026-06-08
Apple's M5 Pro signals a broader shift in laptop processors, with implications for global device makers, developers, and AI users. A teardown suggests Apple is combining chiplet-style packaging, higher memory bandwidth, and GPU-based AI acceleration to strengthen on-device computing while reshaping how premium PCs approach local AI workloads.
2026-06-08
moneywise.com
2026-06-08
moneywise.com
Em Norton
Jun 7, 2026
Jensen Huang, CEO of Nvidia, delivered a keynote speech at Computex — the leading global exhibition focused on AIoT and startups — on Monday. The exhibition’s theme was “AI Together” and Huang’s speech focused on agentic and physical AI, and what it means for the future.
According to Fortune Business Insights, the agentic AI market is currently valued at over $9 billion and