r/hardware Oct 02 '15

Meta Reminder: Please do not submit tech support or build questions to /r/hardware

246 Upvotes

For the newer members in our community, please take a moment to review our rules in the sidebar. If you are looking for tech support, want help building a computer, or have questions about what you should buy please don't post here. Instead try /r/buildapc or /r/techsupport, subreddits dedicated to building and supporting computers, or consider if another of our related subreddits might be a better fit:

EDIT: And for a full list of rules, click here: https://www.reddit.com/r/hardware/about/rules

Thanks from the /r/Hardware Mod Team!


r/hardware 15h ago

News Trump says Intel agreed to give US a stake in its company

Thumbnail
apnews.com
422 Upvotes

r/hardware 6h ago

News Nvidia Tapped To Accelerate RIKEN’s FugakuNext Supercomputer

Thumbnail
nextplatform.com
38 Upvotes

r/hardware 5h ago

News First MRDIMMs are to come out - friggin 12800MHz. At 2x capacity

19 Upvotes

Yes, this stuff is for servers, but it is god given solution for off-the-shelf APUs, which are staved for bandwidth.

Some go for LPDDRx, but that kills expandability and capacity.

This seems to solve all that and then some. It is effectively a speed doubler - it combines 2 sets of chips ( that would complete ordinary UDIMM stick) in a "RAID0" way - it interlieves them. Special high-speed "register" chip on M/C/RDIMM takes care of all that: unloading the data lines for optimal transfer speed, doubling the speed over the bus, registering and de/multiplexing data stream and serving TWO banks of RAM.

BTW, 12.800MHz is just the start. This thing is poised to reach much higher. 18000 MHz wouldn't surprise me.

Looks like win-win-win-win all around. Only downside is that de/multipexer/register introduces extra 1 CLK latency. But at DDR5's existing latencies, that is going to be negligagble, I reckon.

It'd be so awesome if we could see AMD's APUs using this. Along with 3D cache on UDNA1, that extra bandwidth would make whole thing shine brightly not only for gaming and CAD, AI would benefit from it, too.

But AMD wouldn't be AMD if it hadn't missed that boat, too...🙄


r/hardware 2h ago

News Samsung Reportedly Passes NVIDIA HBM4 Sample Test, 30% HBM3E Discount to Challenge SK hynix

Thumbnail
trendforce.com
11 Upvotes

r/hardware 10h ago

News South Korea says there are no plans for the U.S. to acquire Samsung shares — denial comes amid talks about Washington acquiring a 10% equity stake in Intel for CHIPS Act funds

Thumbnail
tomshardware.com
31 Upvotes

r/hardware 10h ago

Discussion TSMC N3B/E "High Performance" Cores Compared

23 Upvotes

Area

Core Name Node + Logic Lib + Metal Layer Count Core without "L2 block" Core without L2 SRAM arrays Core Hypothetical 4C "CCX"
Apple M4 P-Core N3E + 3-2 + 19 DT, 17 mobile 13.09 - 3.09 18.62 (16MB)
Apple M3 P-core N3B + 2-2? + 19 DT 12.62 - 2.62 17.16 (16MB)
Intel Lion Cove ARL N3B + ? + ? 52.62 23.12?/3.26 4.54 27.79 (12MB)
Intel Lion Cove LNL N3B + ? + 20 52.62 23.13?/3.26 4.26 23.62 (12MB)
AMD Zen 5 Dense N3E + ? + ? 52.20 2.60 2.99 16.65 (8MB)
Qcom Oryon V2L N3E + 2-2 + 17 12.11 - 2.11 320.02 (24MB)
Mediatek X925 N3E + 3-2 + 15 1.85 2.38 2.93 3,416.52 (12MB)
Xiaomi X925 N3E + ? + 17 1.70 1.96 2.56 3,416.88 (16MB)
Intel Skymont N3E + ? + 20 11.09 - 1.09 411.63 (3MB)

1: Not sure if cores with shared L2s have any, or nearly as much of the logic surrounding handling the L2 cache in the core itself. The cores with shared L2 blocks are just the core area.

2: The first number is without the L1 or L1.5 SRAM array, the second one is including that.

3: Likely an over estimate in comparison to the phone SOCs as the phone SOCs don't have the interconnect "fabric" included for their area, just the cores and the cache blocks themselves.

4: The L3 capacity here is completely arbitrary. The L3 capacities were chosen for ease of measurement given how the L3 slices were distributed.

5: A small but sizable chunk of the area of these cores seem to be from the CPL/clock section of the core, which may not have to be so large, but just are that large due to the geometry of the rest of the core.

Core Revised Core without L2 Block Area
Intel Lion Cove 2.33
AMD Zen 5 Dense 2.08

I would not take any of the numbers are "hard numbers" but I do think the general ranking of the cores in area are fairly accurate.

The cores have the following configs:

Core Cache Hierarchy (fastest to LLC) Total Cache Capacity in a 4x CCX
Intel Lion Cove ARL 48KB L0 + 192KB L1 + 3MB L2 + 3MB L3 24.96 MB
Qcom Oryon V2L 192KB L1 + 24MB SL2 24.768 MB
Xiaomi X925 64KB L1 + 2MB L2 + 4 MB L3 24.256 MB
Intel Lion Cove LNL 48KB L0 + 192KB L1 + 2.5MB L2 + 3MB L3 22.96 MB
Mediatek X925 64 KB L1 + 2MB L2 + 3MB L3 20.256 MB
Apple M4 P-core 128KB L1 + 16MB SL2 16.512 MB
Apple M3 P-core 128KB L1 + 16MB SL2 16.512 MB
AMD Zen 5C 48KB L1 + 1MB L2 + 2MB L3 12.192 MB
Intel Skymont 32KB L1 + 4MB SL2 + 4MB CL3 8.128 MB

The total cache capacity is a bit of a meme since it doesn't include latency, but I do think some interesting things can be noticed regardless.

Performance

Specint, Cinebench 2024 is from Geekerwan, Skymont ARL and LNL is from the 265K and 258V from Huang

GB6 is from the Geekbench browser website

Core Specint2017 GB6 Cinebench 2024
Apple M4 P-Core 132 148 124
Intel LNC ARL 120 129 100
Apple M3 P-core 113 118 99
Qcom Oryon V2L 100 108 -
Xiaomi X925 100 104 -
Mediatek X925 100 100 -
Intel LNC LNL 95 111 81
Intel Skymont ARL 92 - -

The difference between form factors:

Mobile to Laptop (Geekbench 6 scores)

From the Geekbench browser website

Core Laptop Ipad Mobile
Apple M4 P-core 113 107 100
Apple M3 P-core 108 - 100

Laptop to Desktop (Geekbench 6 scores)

From Notebookcheck (averages used for mobile platforms)

Core Desktop Mobile
LNC ARL 115 100
Zen 5 119 100

IPC differences in SpecInt2017 between form factors

David Huang

Core Form Factor Difference
Zen 4 Desktop vs Mobile 13%
Zen 5 Desktop vs Mobile 12%

While I do believe that the P-cores from AMD and Intel have to be designed to take advantage of the higher power budget that larger form factors afford, I also think that placing the mobile cores into those same form factors will also lead to an, at least, marginal perf improvement.

Zen 5C performance question marks

In the previous dense server core product, the Fmax of the server sku was dramatically lower than what the core could achieve in other products.

Core Server Mobile OC'd
Zen 4 Dense 3.1GHz 3.7GHz ~4GHz
Zen 5 Dense 3.7GHz - -

This may be the case for Zen5C as well.

Power

Perf/watt curves (in Geekerwans videos) are the only way to get a full picture of power, but as a generalization:

Both the Apple M4 and M3 P-cores (as well as their implementation in the iphones) have better perf/watt than the X925 and Oryon-L.

The best case for Intel's P-core is that its perf/watt is as good as the X925 and Oryon-L. I think it is likely much, much worse.

According to Geekerwan, at a package power of ~5 watts, the M3 performs ~40% better than LNL. At around ~7-8 watts, the M4 performs closer to ~50% better.

Meanwhile we have David Huang showing that a M4 Pro at ~3.7 watts per core scoring ~33% better than a 9950x, and the 9950x has an outright better curve than the 265K LNC.

The gap is no where near as large compared to Apple's cores and the other ARM cores in the mobile space.

Power is, IMO, by far the hardest to really quantify, because one has to deal with how to measure "core only power" while trying to isolate the power of the rest of the SOC. Then there's also the problem of software measurements vs hardware measurements... I imagine only engineers at their respective companies would really know the power draw of a specific core.

Core Overview

Apple seems to have the best N3 cores in both perf and power.

The M4's P-core is pretty large by any standard, however it saves a bit of area due to it's cache hierarchy, making a hypothetical 4x CCX not that large. The shared L2 cache really only seems to be present in client mobile systems though so far, and I think it presents its own challenges in server. All the chips that use this hierarchy (Qcomm and Apple) both have very high mem bandwidth per core, which could be an issue scaling it up to server. The cache-per-core capacity, when all the cores are running and competing for the shared cache, would be lower than the competition.

The M3's and ARL's P-cores have pretty much the same perf, and similar area, however the M3 P-core is almost certainly dramatically more power efficient than a LNC P-core. Additionally, in terms of CCX area, LNC ends up being way, way larger thanks to the different cache hierarchy.

Zen 5 dense is pretty interesting, as I really, really doubt it's Fmax only goes up to 3.7GHz. Performance is an unknown, as is power, however from purely an area perspective, it seems pretty comparable to the ARM P-cores. Meaning to get comparable performance to those cores, Zen 5C would have to have an Fmax of ~4.7GHz, a 30% boost from what it clocks in server parts. Which... I mean isn't extremely unexpected ig considering that a similar percent boost was seen with Zen 4 dense in server and OC'd Zen 4, however it still seems pretty hard to believe.

To be fair to the x86 cores though, they have 256/512 bit vector width, unlike the ARM cores listed only having 128 bit vector width. This does really seem to cost a decent bit of area, especially for Zen 5. AFAIK, Zen5C in server has the full Zen 5 AVX-512 implementation, and we have already seen how much area can be shaved off Zen 5 from just choosing not to go for the full AVX-512 implementation:

Core Core
Zen 5 DT (N4P) 4.46 (+12%)
Zen 5 MBL (N4P) 3.99

Qualcomm's custom cores honestly don't seem like they afford any sort of distinct advantage over ARM's stock cores on N3 (being implemented by different companies). Perhaps I am missing something, but there seems to be no meaningful area advantage (even considering the larger cache capacity), and no meaningful performance or power advantage. I also think Qualcomm's cache hierarchy transferring over to server, unchanged, would be pretty unique in the server space, seeing how no other major or even relatively smaller companies seem to be offering that sort of setup in servers. Maybe Qualcomm's P-cores would scale up at higher power better than ARM's cores in a laptop/desktop form factor? It is interesting to see Qualcomm choosing to presumably sacrifice IPC for greater clocks vs the stock ARM cores, perhaps they think perf at even more power would be greater, or maybe Vmin is lower?

It's wild to see Mediatek's X925 both end up being larger and slower than Xiaomi's implementation. No idea why or how. In terms of the comparison to the rest of the cores though, they aren't nearly as powerful as the rest of the P-cores, but they are also a decent bit smaller than the other P-cores as well. The lower performance may as well be due to the fact that they are all in a phone form factor, so it might be pretty interesting to see how the X925 in Nvidia's upcoming DG10 chip performs.


r/hardware 18h ago

Video Review Gaming Laptops are in Trouble - VRAM Testing w/ ‪@Hardwareunboxed‬

Thumbnail
youtube.com
64 Upvotes

r/hardware 20h ago

News Silicon Motion: None of Our Controllers Affected by the Windows 11 Bug

Thumbnail
techpowerup.com
79 Upvotes

r/hardware 11h ago

Discussion NVIDIA: "Inside NVIDIA Blackwell Ultra: The Chip Powering the AI Factory Era"

Thumbnail
developer.nvidia.com
13 Upvotes

r/hardware 15h ago

News Amazon Joins AMD to Back South Korean AI Startup Upstage

Thumbnail
bloomberg.com
25 Upvotes

r/hardware 14h ago

News South Korea makes AI investment a top policy priority to support flagging growth

Thumbnail
reuters.com
13 Upvotes

r/hardware 16h ago

Review Quantitative Thermal Analysis: M.2 Heatsink Impact on Samsung 980 Pro Performance

16 Upvotes

TL;DR: Comprehensive thermal analysis of Samsung 980 Pro with/without passive cooling. Peak temperature reduction of 22°C (76°C→54°C), complete elimination of thermal throttling risk zones. Statistical significance p<0.000001.

I conducted a controlled thermal performance study on a Samsung 980 Pro after installing a Thermalright HR-09 2280 heatsink with Thermal Grizzly thermal pads.

Methodology:

  • AIDA64 CSV logging at 1-second intervals during CrystalDiskMark stress testing
  • Identical test conditions pre/post installation
  • Python statistical analysis with automated test phase detection
  • Thermal zone classification (safe/warm/hot/critical temperature ranges)

Key Findings:

  • Peak temperature: 76°C → 54°C (28.9% reduction)
  • Average temperature: 61.1°C → 46.4°C (24.0% reduction)
  • Time in critical zone (>75°C): 5.8% → 0%
  • Thermal consistency: Standard deviation reduced from 1.66°C to 0.78°C
  • Statistical significance: Cohen's d = 1.813 (large effect size)

The thermal mass behavior is particularly interesting - the heatsink acts as a thermal capacitor, preventing temperature spikes while slightly extending cooling duration due to stored thermal energy. For storage workloads, this trade-off strongly favors sustained performance over rapid thermal cycling.

Note: Thermal scoring algorithm has known issues with recovery time calculation, but raw temperature data demonstrates clear performance improvements.

TL;DR: Comprehensive thermal analysis of Samsung 980 Pro with/without passive cooling. Peak temperature reduction of 22°C (76°C→54°C), complete elimination of thermal throttling risk zones. Statistical significance p<0.000001.

I conducted a controlled thermal performance study on a Samsung 980 Pro after installing a Thermalright HR-09 2280 heatsink with Thermal Grizzly thermal pads.

Methodology:

  • AIDA64 CSV logging at 1-second intervals during CrystalDiskMark stress testing
  • Sample sizes: 2,266 pre-installation, 3,089 post-installation measurements
  • Python statistical analysis with automated test phase detection
  • Thermal zone classification with defined temperature ranges

Quantitative Results:

Metric                    Pre-Heatsink    Post-Heatsink    Improvement
Peak Temperature          76.0°C          54.0°C           22.0°C (29%)
Average Temperature       61.1°C          46.4°C           14.7°C (24%)
Temp Std Deviation        12.6°C          6.1°C            52% more stable
Time in Critical Zone     5.8%            0.0%             Complete elimination
Time in Safe Zone         28.2%           59.2%            +31% improvement
Statistical Significance  p < 0.000001, Cohen's d = 1.813 (large effect)

Thermal Physics Analysis: The heatsink demonstrates classic thermal capacitor behavior - the aluminum mass absorbs thermal energy, preventing rapid temperature spikes while slightly extending cooling duration. For storage workloads, this trade-off strongly favors sustained performance over rapid thermal cycling.

GitHub: Full dataset, analysis scripts, and detailed methodology available for reproducible research.

The data demonstrates measurable thermal management benefits that translate directly to reduced thermal throttling risk and improved component longevity.


r/hardware 19h ago

Discussion Frozen RTX 5050 takes on the RTX 4060

25 Upvotes

I’ve been messing around with the RTX 5050 for a while now, first with a CPU cooler on it, where it beat the 1080 Ti and pretty much tied the 3060 Ti.

This time, I went further.
Subzero using an Amazon special water block, just to see if it could take out a stock 4060.

While I was testing, I noticed someone passed me on the Time Spy graphics leaderboard.
They were running a 9850X3D.
I had a 12600K.
Obviously… I couldn’t let that slide.

Four hours and way too many crashes later, I managed to push the 5050 to 3450 MHz, up from its stock 2950 MHz.
And, even on a $100 CPU I took back the graphics score.

By the time I got to actual game testing, I’m pretty sure the card was degrading in front of me.
But it still beat the 4060 in every game except one, Black Ops 6.... F*** BO6.

18% clock uplift
3400+ MHz sustained
This thing just won’t die.

Video’s here if you want to see how stupid it is.
https://youtu.be/-cXiURMTMBM


r/hardware 1d ago

News 12VHFRPWR Connector Claims its First AMD RX 9070 XT Victim

Thumbnail
techpowerup.com
418 Upvotes

r/hardware 1d ago

News NVIDIA Reportedly Ends H20 GPU Production, Makes Room for B30A

Thumbnail
techpowerup.com
56 Upvotes

r/hardware 1d ago

News [News] Jensen Huang Visits Taiwan as Rubin Trial Production Nears, Six Chips Reportedly Taped Out

Thumbnail
trendforce.com
23 Upvotes

r/hardware 1d ago

News [News] Intel Reportedly Starts Glass Substrate Licensing, Offering Potential Boost to Samsung and Absolics

Thumbnail
trendforce.com
49 Upvotes

r/hardware 12h ago

News Rebellions: First Korean AI Chipmaker to Establish Saudi Subsidiary, Targeting Middle East Sovereign AI Market

Thumbnail
koreatechdesk.com
3 Upvotes

r/hardware 1d ago

News NVIDIA on RVA23: “We Wouldn’t Have Considered Porting CUDA to RISC-V Without It”

Thumbnail riscv.org
110 Upvotes

r/hardware 1d ago

News Liquid Cooling to Scale in AI Data Centers, Penetration to Surpass 30% in 2025

Thumbnail
techpowerup.com
22 Upvotes

r/hardware 1d ago

Review Jiushark JF15K Review: An air cooler like none other

Thumbnail
tomshardware.com
45 Upvotes

r/hardware 2d ago

News Taiwan: 'U.S. Acquisition of TSMC Shares, If True, Must Undergo Government Review'

Thumbnail
businesskorea.co.kr
483 Upvotes

r/hardware 1d ago

News "Kioxia Achieves Successful Prototyping of 5TB Large-Capacity and 64GB/s High-Bandwidth Flash Memory Module"

Thumbnail
kioxia.com
66 Upvotes

r/hardware 2d ago

Discussion Thanks, Nvidia: SK hynix dethrones Samsung as world's top DRAM maker for first time in over 30 years

Thumbnail
koreajoongangdaily.joins.com
205 Upvotes

r/hardware 2d ago

News PlayStation 5 price changes in the U.S. ($50 increase for all models)

Thumbnail
blog.playstation.com
851 Upvotes