Nvidia RTX Spark

(nvidia.com)

174 points | by shenli3514 13 hours ago

35 comments

  • perarneng 2 minutes ago
    No thunderbolt is a big no for me. Its one of the greatest feature of MacbookPro that makes it dockable and expandable as a desktop with a good thunderbolt dock.
  • nerdjon 16 minutes ago
    Some competition for Apple in this space and competition for Intel and AMD is great.

    But I really do question how well Windows on Arm is really going to work out long term.

    For Apple it worked because they were able to force the issue. If you wanted a new Mac it was going to be Arm and we all knew eventually (this year or is it next year?) Intel support would drop. Over time we have seen M series exclusive features.

    Developers were forced to update or abandon Mac which gave users a great experience (with some early growing pains).

    This is something that Windows will never be able too do. They will always be stuck maintaining an emulator and a likely large subset of apps only supporting one over the other. (also does this work the other way around with an Arm only app working on x86?)

    This seems like a repeat of when it was not uncommon for games to only support Intel or AMD or NVIDIA or AMD. But worse since they are not both x86. Sure at least we have emulation but just like with Rosetta2 it shouldn't ever be the long term solution.

  • giancarlostoro 23 minutes ago
    Can it work with Linux? That's all I care about.
    • tarruda 8 minutes ago
      I don't think there's any incentive for Nvidia to make this a Windows-only device, so most likely it will be fully supported on Linux, just like their GPUs are.
    • dismalaf 7 minutes ago
      It's a collaboration with Microsoft so going to say no, probably not.
    • newsclues 11 minutes ago
      This is strangely absent from the news.
  • 2001zhaozhao 1 hour ago
    I think this is the first time an ARM windows device gets marketed for gaming. Would be interesting to see what kind of performance hit games have on the x86 to ARM translation layer.
    • fidotron 1 minute ago
      Rosetta on Mac was obviously impressive. There was also impressive Arm->Intel translation in the mobile ecosystem at one time.

      One reason it works surprisingly well on modern systems is how much is offloaded to the GPU. You aren't going to get great power optimization or anything without it being truly native though.

      There are games which are CPU limited though, and it will be interesting how those do. Curiously those also tend to be in engines with Arm support already.

    • lowbloodsugar 37 minutes ago
      Apple Silicon has a special mode that modified how the ARM chip handles memory transactions to be like x86. Does this nvidia ARM have the same?

      What would be interesting to me would be how quickly developers start targeting ARM64 directly.

      • odkurzacz 30 minutes ago
        For Apple use of Rosetta 2 was only temporary as they moved whole lineup to ARM. MS would not abandon x64 anytime soon. So I'm guessing they will try hard to convince developers to release for both architectures.
  • wolttam 7 minutes ago
  • seanalltogether 31 minutes ago
    "Unified Memory" still means divided address space right? You have to pre-allocate system vs gpu and copy from one to the other?
  • hs86 10 hours ago
    I am wary of those ARM-based Windows machines because I am unsure how good the ongoing driver support for those SoCs will be. Will they even outlive the Windows version they currently ship with?

    Looking at devices like the NVIDIA Shield gives me some hope that NVIDIA will be better than Qualcomm here. I just hope this is not a case where the OEM has to purchase X years of driver support from the chip vendor beforehand, and that NVIDIA will provide support directly itself.

  • babhishek21 32 minutes ago
    Question is: "Can it run Doom?"
  • PeterStuer 10 hours ago
    It's been almost 30 years, and a single letter changed. When will we get the Sparkstation, the UltraSpark and the SuperSpark?
  • timpera 12 hours ago
    We'll need to wait for the benchmarks, but this looks great! Windows 11 ARM64 is already amazing, and if these really are an upgrade from the Qualcomm chips we're going to have even better laptops on the market.
  • numron-dev 25 minutes ago
    Yeaaaah . But at what Cost though.
  • Tiberium 11 hours ago
    For anyone curious to know how this will fare against Macbooks, at least in CPU perf: DGX Spark has the exact same GPU and CPU as the top RTX Spark laptops will, so you can just directly compare from that.

    Of course, DGX Spark is a miniPC, so laptops will likely be slower due to power limits/throttling.

  • tonoto 5 hours ago
    What is this product anyway? Is it a general purpose CPU or is it specifically designed for MS Windows? Nvidia stepping back from the open source?

    "Introducing the NVIDIA RTX Spark™ Superchip. The fusion of NVIDIA AI and RTX graphics in a single chip redefines Windows PCs and delivers amazing creating, AI development, and gaming—on the slimmest, most beautiful RTX laptops ever and small, ultra-efficient desktops."

    • mingus88 3 hours ago
      It’s nivdia attempting to compete with Apple’s M-series
      • Almondioco 28 minutes ago
        Its nvidia attempt to gain additional market share and expected as well. If the whole ecosystem is around nvidia and its the easiest way of running stuff, Nvidia offering more enterprise infrastrcuture allows companies to just buy directly nvidia.

        Nvidia is also very very rich and pushes the boundaries of stuff. They stoped waiting for industry standards. You can see this in there network stuff. All nvidia.

        Next logical step (at least now, not something i thought about) was there CPU for their GPU racks/clusters/systems.

        Now they have everything anyway, RTX Spark is just logical.

        I don't think its specificly targeted at Apple at all.

        Apple has like 10-15% market share and just because some IT nerds buy themselves a mac mini doesn't mean much.

        Plenty of them actually just run openclaw without local models. Something which surprised me quite a lot.

        But i have two 4090 at home. They consume a lot of power and i had to research the proper Mainboardmodel and had to mod one 4090 to use water cooling because they run too hot.

        There Spark setup was at 3k, way to expensive for normal people. If they can get this down and sell more, great for their ecosystem (strengthening it) and getting more money from people.

        It does surprise me though that they have enough capacity for this chip and not just putting everyting in Rubin but perhaps the build out has slowed down a little or they start to diverse already for economic savety

      • giancarlostoro 22 minutes ago
        Also sounds like they are ditching the discrete GPU altogether.
      • dawnerd 46 minutes ago
        All the news articles in my feed mentioned Nvidia reinventing personal computing which is laughable given the specs are worse than the m series. I’m guessing they saw how well Apple devices were selling and rushed to get something similar out so they can ride the hype train and have something to fall back on if ai DC spend slows down.
        • AlotOfReading 26 minutes ago
          There's a lot of companies trying to support datacenter systems like GH and Rubin that don't have dev hardware remotely resembling it. M-series isn't a good option, speaking from the personal experience of currently using one for this exact purpose.
    • hasteg 1 hour ago
      I wouldn't say it's Nvidia stepping back from open source... if anything this is doubling down on it, as one of the selling points of this is the 128GB of unified memory which will allow for hosting local models (i.e, nvidia's new open model they just released). I guess it's pretty cool, I'm a big supporter of local LLMs/open weight models so seems enticing to me, although I'm not sure this will be super applicable to a lot of regular consumers. Seems like a pretty niche product.
  • minraws 11 hours ago
    Awesome, won't be buying it all at current prices but once they calm down, I will very much like to get one.

    Around 2-3K USD something with a good GPU + CPU + 128GB of integrated RAM is just going to be an awesome experience.

    Considering Mac options are north of 5K+ even on a regular day.

    • Tiberium 11 hours ago
      DGX Spark is $4700, so I kind of doubt that RTX Spark's top configs will be cheaper than that.
      • minraws 11 hours ago
        isn't dgx ai first and rtx prosumer first. I think it will be cheaper longer term not atm with component inflation
      • KeplerBoy 11 hours ago
        The DGX also contains the 200 GbE networking and linux support.
        • fmajid 10 hours ago
          The ConnectX 7 2x200 Gbps networking card in the DGX Spark alone is worth $700
          • KeplerBoy 9 hours ago
            To be fair the connectx-7 in the spark can't even push 2x200 Gbps since it is connected via 4 pcie lanes.
            • Ballas 6 hours ago
              Technically it's connected via 8 PCIe gen 5 lanes (two 4x connections), allowing ~100Gbps per port.
              • KeplerBoy 5 hours ago
                Thanks for the correction. I should have looked it up; I only remembered it being somewhat odd.
        • Tiberium 11 hours ago
          Laptops will also have to contain a much tighter configuration, display, keyboard, camera, etc ;)
          • minraws 10 hours ago
            there is desktop variant as well
  • boredatoms 13 hours ago
    Is this just dgx spark, but a laptop?
    • pella 13 hours ago
      yes, same chip

      + Windows

      + Screen

      - ConnectX-7 Smart NIC

      • throw0101c 13 minutes ago
        > - ConnectX-7 Smart NIC

        Can the link type be toggled between Ethernet and Infiniband? (Don't think I've ever heard of a laptop with IB.)

      • pedrocr 6 hours ago
        + battery too. I've wondered if a mini pc with battery would make for a good form factor. I often move between places where I have a desk with a screen but still use a laptop because I want to just suspend and resume. If a mini pc had a small battery just to hold its RAM while suspended I could move between places and just plug in a single USB-C cable and have my full workstation up and running. The thermals could be better than in a laptop and having a built-in UPS better than with a desktop. But last time I checked no one packaged things like that.
        • pbadams 1 hour ago
          There's the Khadas Mind series of mini pcs. They have a proprietary docking interface though. Agree that it would be great if this form-factor was more common.
      • zer0zzz 12 hours ago
        What about the desktop version? It seemed like it is not a dgx since it has the CPUs cores done by mediatek
        • Bulat_Ziganshin 6 hours ago
          They didn't say that Mediatek made the cpu sores. Grace is NVidia's own cpu arm cores. I bet that Mediatek made other parts of SoC necessary for a notebook
          • wtallis 48 minutes ago
            NVIDIA hasn't done custom CPU cores for anything they've yet branded "Grace". The original Grace data center CPU (paired with the Hopper data center GPU) used ARM Neoverse V2 cores. The "GB10" chip shipped in DGX Spark and announced here for RTX Spark uses Cortex X925 and Cortex A725 CPU cores.

            Physically, NVIDIA did the GPU chiplet and Mediatek did the other chiplet that has the CPU, DRAM controller, and IO.

        • cpgxiii 4 hours ago
          The DGX Spark/GB10 has CPU cores from Mediatek (in a pretty odd cluster configuration, too).
        • pipyakas 12 hours ago
          desktop is GB300, not GB10 like Spark
          • wtallis 46 minutes ago
            GB300 is nominally "available" in desktop form factor workstations priced around $100k. That's a few orders of magnitude away from the ordinary desktop PC market that consumers participate in.
          • KeplerBoy 11 hours ago
            they also announced a GB10/N1X windows desktop mini PC.
  • throw0101c 21 minutes ago
    Imagine a Beowulf cluster of these. /slashdot
  • rsolva 11 hours ago
    Will NVIDIA get a monopoly on providing laptops and desktops with a lot of RAM going forward?
  • jmyeet 51 minutes ago
    One of the thing that cripples these small AI PCs (including the DGX Spark) is memory bandwidth. The DGX Spark had 275GB/s. For comparison, a 5090 is ~1800GB/s. The M3 Ultra in the current high end Mac Studio is 819GB/s and there will likely be new M5 Mac Studios in Q3. Memory bandwidth is going to directly limit your tokens/sec.

    I didn't see this in the announcement linked but according to other reporting [1] the RTX Spark has memory bandwidth of 600GB/s.

    I got curious about some other cards. The 5070Ti is 896GB/s. The 5080 is 960GB/s. The RTX 6000 Pro is the same as the 5090, 1792GB/s. The 5090 has an MSRP of $2k but more realistically is $3-3.5k that you can buy it at. The RTX 6000 Pro is ~$10k (ignoring a cheaper, lower-power version). The 6000 Pro has slightly more CUDA cores but is otherwise mostly just a 5090 with more memory. It shows how much NVidia uses VRAM for market segmentation.

    It's also why the Apple chips are so interesting. A theoretical M5 Ultra in 2026 Mac Studios will possibly exceed 1000GB/s, putting it in rarefield air. Memory bandwidth isn't everything, obviously, but for this market we're largely talking about inference and memory bandwidth is going to limit performance.

    As another data point, the M3 Ultra has no FP4 mode and for FP16 it's 57 TFLOPS (vs 419 for the 5090) so there's still a gap here but I think Apple has a real market opportunity here if they decide to focus on it.

    [1]: https://wccftech.com/nvidia-enters-pc-space-with-rtx-spark/

  • nokeya 11 hours ago
    It was wintel (windows + intel) before. This will be what? Windia? Wintek?
  • jqbd 13 hours ago
    They made their own x86 CPU? Or was that part outsourced? Ok ARM MediaTek.
    • try-working 13 hours ago
      ARM cpu made by MediaTek.
      • zamadatix 12 hours ago
        But probably worth clarifying it's not a typical "MediaTek CPU" some might assume by that. It has Nvidia's customized ARM CPU implementation + their GPU.
        • TiredOfLife 2 hours ago
          This has off-the-shelf Arm cores.
      • Bulat_Ziganshin 6 hours ago
        I think that Nvidia made GPU and CPU, and Mediatek made other parts of SoC necessary for a notebook. Grace is Nvidia's own CPU ARM core
        • SomeHacker44 6 hours ago
          I believe Grace is an ARM designed core. Vera is the nVidia designed core.
  • lowbloodsugar 40 minutes ago
    ARM64+GPU sure seems like the future. I'm still using my M1 and even that can handle models well, has decent graphics, M5 is a beast, and M6 must surely go even bigger on LLM compute. Now Microsoft has a compelling ARM64+GPU future too.

    What does AMD or Intel have here?

  • mastermage 12 hours ago
    Is this finally Macbook Chip Efficiency coming to Windows or will it just be shittier compatibility for slightly better battery life?
    • zer0zzz 12 hours ago
      I heard leaked geekbench putting it behind the m3, which is couple years old now.

      All I care about is if I can get one of these for significantly less than a dgx and get Linux on it for some cuda Blackwell kerneling.

  • ma2kx 12 hours ago
    Unified RAM means its soldered to the mainboard, right?

    I'm not sure if I like this. Sure for a laptop this might be not a big problem but if this ARM ecosystem is a success it will spread to desktop computers and I fear we could lose the existing modularity.

    • phcreery 15 minutes ago
      I think unified RAM means soldered to the SoC, which is in turn soldered to the mainboard
    • Skinney 12 hours ago
      "Unified" means that it's shared between CPU and GPU, I believe.

      But yes, it tends to be soldered on.

    • Bulat_Ziganshin 6 hours ago
      No, but LPDDR means soldered, there are no LPDDR dimms
      • debugnik 4 hours ago
        There's LPCAMM2, but it's very recent. The Framework Pro laptop supports it, for example, although only on the Intel variant.
    • Rekindle8090 11 hours ago
      [dead]
  • t_mahmood 11 hours ago
    After nvidia's many years of neglecting Linux, paired with direct Microsoft's involvement? Are we going to trust them, to allow installing Linux in these easily?

    I don't think so.

    This most likely be a winmodem situation, again

    • TiredOfLife 5 hours ago
      DGX Spark has the same soc and ships with Ubuntu
      • t_mahmood 28 minutes ago
        Okay, but still it's highly skeptical trusting MS, and NVIDIA.
    • bigyabai 1 hour ago
      [dead]
  • donkeylazy456 12 hours ago
    hope nvidia support driver better than qualcomm. also hope they support linux soon.
  • throwa356262 12 hours ago
    I have no idea how powerful or power efficient these guys are, but this seems to be the first step in a bigger push towards Windows on ARM (without loosing gaming).

    I think more announcements will follow soon from other companies.

    • fmajid 10 hours ago
      My DGX Sparks are the first and only devices I have with 200W USB-C PD. Low power by AI workstation standards, but intolerable in a laptop.
      • MoonWalk 48 minutes ago
        Intolerable? Why?
        • ternaryoperator 38 minutes ago
          Battery life
          • MoonWalk 14 minutes ago
            The comment I'm replying to appears to be talking about power DELIVERY, not consumption. Why would extra power-delivery capacity be intolerable?
    • jauntywundrkind 11 hours ago
      It's worth noting that Nvidia power management on Linux has been absymal. There also aren't any of the usual power management options to see how much power things are using, which is quite atypical for a modern system.

      Nvidia really threw stuff over the wall with the DGX Spark release. They don't seem to really care. I sort of think they'll spend a little more time on Windows, where there's no pesky upstreaming to do and they can just do whatever, but man, it's such typical hubris from Nvidia to build such an expensive box with good chips but make it basically unsupportable and roasty hot all the time.

      You also generally have to run an ever more stale two year old Ubuntu derived DGX OS to get anywhere, with bespoke kernel and drivers all. None of it is well supported, none of it just works like a comparable PC or even well behaved arm system would.

      As for other ARM, there were rumors AMD Sound Wave is/was going to be a ~10W arm APU, but there hasn't been much said about it lately. Honestly given the ram crunch, it's maybe just not worth trying to build a system with a cheap core, if the rest of your costs are going to stay so stratospheric. https://www.techpowerup.com/341848/amd-sound-wave-arm-powere...

  • hgoel 12 hours ago
    Looks like the MSI one might be a 2-in-1, if it has good stylus support I might have a good candidate for an upgrade, thought my ~3-4 year old Galaxy Book is holding up alright for now.
  • zmk5 12 hours ago
    I really like this, but I think the reason Apple Silicon took off was that Apple sort of forced devs to support ARM. Not sure if Microsoft can do the same for Windows…
    • supersing 12 hours ago
      Developers weren’t really “forced” to support ARM. They simply recognized that all future Macs would be ARM, whereas most new PCs would continue to run on x86. So the incentive to adopt ARM was much weaker on the PC side.
    • trvz 12 hours ago
      They didn’t though. Rosetta 2.
      • ptole_my 2 hours ago
        rosetta is a relatively short term solution. will be supported up to macOS 28
    • aa-jv 11 hours ago
      Microsoft can do the same for windows - they need to address the fat bundle solution that Apple came up with, but for Windows, though ..
  • ChrisArchitect 6 hours ago
    Related:

    A powerful new chapter for Windows PCs, accelerated by Nvidia RTX Spark

    https://news.ycombinator.com/item?id=48352693

    Surface Laptop Ultra: Made for World Makers

    https://news.ycombinator.com/item?id=48352627

  • cyanydeez 12 hours ago
    competitor is already on the market and is x86: AMD AI 395+

    bechmarks with DGX arnt spectacular for NVIDIAs software and CUDA lead.

    wouldnt count on this being a price/compute challenger. especially with overpriced VRAM.

    • porphyra 12 hours ago
      Strix halo's 8060S gpu is very weak, and is roughly equivalent to a 4060 laptop GPU, whereas GB10's gpu is equivalent to a desktop 5070. For LLM throughput, tok/s is similar due to bottleneck by memory bandwidth, but the GB10 has 3x faster prefill. People have also been able to squeeze out much better performance on GB10 using NVFP4 and other improvements in the months after the DGX Spark launch, so don't be misled by early lackluster benchmarks. For the RTX Spark, which also targets gaming and creative applications, the 3x faster GPU is quite nice.
    • xyzzy123 12 hours ago
      Or like a m4 max? This thing has <300GB/s vs the max with 550GB/s

      All those CUDA cores in the sparks but they're starved for memory bandwidth.

      I am still waiting for NVidia to release a system that legit beats 3090 maxxing for the home gamer...

      • moondev 12 hours ago

          Spark:
          OS: Windows/Ubuntu
          Mbw: 300GB/s
          Cuda cores: 6000
          GPU accelerated containers: yes
        
        
          M5 max:
          OS: macOS
          Mbw: 600GB/s
          Cuda cores: 0
          GPU accelerated containers: no
        • xyzzy123 12 hours ago
          I feel like the shape of the market right now for "home lab" inference is:

          The sparks are good if your ultimate plan is to spend even more on NVidia hardware in future to run your dev setups at usable speeds. Or, you're developing for a work cluster.

          If you mainly want to run local models at acceptable speeds portably, buy a mac with lots of RAM. If you’re happy with non-portable / racked, buy 3090s (dense) or mac studios (MoEs). Buy newer cards if you are restricted on power or slots. If you are rich, buy a6000 blackwells.

    • zer0zzz 12 hours ago
      The only Question is is it worth suffering hip and x86? I suspect a lot of folks might like a machine that mimics their GB300 But costs less than a dgx.

      Also I heard the tensor core instructions on the dgx are gimped and you’re better off with a rtx pro x000. Is that the same with these machines?

    • SilverElfin 12 hours ago
      Is CUDA really a lead for long? Aren’t all the latest competitive approaches avoiding all the standard software stacks and writing deeply customized software that is very directly tied to whatever hardware they use?

      And is it really a way to lock in people? With AI coding tools, isn’t it trivial to write software on top of CUDA and rewrite it to target some other hardware?

  • SilverElfin 12 hours ago
    Some other relevant discussions and sources …

    NVIDIA and Microsoft Reinvent Windows PCs for the Age of Personal AI

    https://news.ycombinator.com/item?id=48352705

    NVIDIA DGX Station for Windows Puts a Trillion-Parameter AI Supercomputer on Every Enterprise Desk

    https://news.ycombinator.com/item?id=48352691

    Introducing Surface Laptop Ultra: Made for world makers

    https://news.ycombinator.com/item?id=48352627

    Introducing a powerful new chapter for Windows PCs, accelerated by NVIDIA RTX Spark

    https://news.ycombinator.com/item?id=48352693

    • EugeneOZ 12 hours ago
      2 comments in total there
  • SilverElfin 13 hours ago
    It all sounds good on paper. But I have trouble believing Windows can be a good platform for this. Microsoft has lost all trust after inserting ads into windows, slowly removing power user features, and exploiting every dark pattern they can. And for years, the ARM based Windows laptops have been useless due to app compatibility issues. Why would this change now? Is it priced to be a lot cheaper than Apple’s laptops? Or is this a niche product for AI developers basically?
    • TreeInBuxton 2 hours ago
      A lot of the app compatibility issues on current machines are down to Qualcomm's poor drivers - the actual core bits are mostly okay.
    • __atx__ 12 hours ago
      The "gaming" take is a strange one indeed for an ARM platform. Hopefully they (Microsoft or Nvidia?) put some real effort into the translation layer. They claim modern AAA games, but it is possible they strongarmed the developers to make them an ARM build for a few select titles...
      • satvikpendem 11 hours ago
        It's clear gaming was not a major concern, it's just "good enough" for someone running AI models and occasionally wants to play some games, not made to primarily play games.
      • SilverElfin 12 hours ago
        Yep. I noticed the press releases talk about all the partners they have. It seems like a desperate attempt to manufacture a consensus to invest in this new hardware instead of leaving it sort of abandoned like the other Windows ARM stuff. But the problem is that these attempts end up having a few very visible apps working on the architecture and others not actually doing anything substantial.

        Sure the graphics capabilities are probably very good. But if you’re a game developer who has traditionally built on Windows on x86 chips, would you want to invest in this new chip or invest in making games for the Apple ecosystem? Aren’t there more new customers to reach in the Apple world than this new Nvidia world?

        • andsoitis 12 hours ago
          > But if you’re a game developer who has traditionally built on Windows on x86 chips, would you want to invest in this new chip or invest in making games for the Apple ecosystem?

          Windows and the new chip. Higher developer productivity and higher chances of a substantial audience.

    • satvikpendem 11 hours ago
      Who cares about Windows, the goal is to run local AI models similar to AMD Strix Halo and Apple Silicon machines. The OS is honestly a distant last concern as long as the models work well, as you could put Linux on these too, but not sure how well wake lock works.
    • bentcorner 11 hours ago
      Anecdotally Windows ARM works fine for me, although to be honest most of my work is command line + browser anyway. WSL works like a treat. Steam installs and most lower end games also play fine on my ARM laptop too. Games that require kernel anticheat don't work.

      I think they make a great "second device" where you have something meatier to fall back to if something doesn't quite work right. I'm not sure if it's ready to take on the "main device" role just yet. But it's a far far better experience than the Surface RT days.

    • try-working 13 hours ago
      Hopefully MSFT would look at this as a do or die system, and go all in on improving the user and ownership experience. Will they? Not so sure.
      • Gigachad 11 hours ago
        Microsoft sees windows purely as a platform to sell AI products these days.
      • jfim 12 hours ago
        That's what they're working on, in theory, with Windows K2.
        • fhn 12 hours ago
          I would never trust Microsoft. Their next drama is revoking Office 2019 perpetual licenses https://www.youtube.com/watch?v=KRnno9VIZx0. It never ends with them because they know they have you by the balls.
          • twilo 11 hours ago
            I trust them on a daily basis. No issues thus far..
    • sorry_outta_gas 13 hours ago
      [dead]
  • pseudosavant 11 hours ago
    This may finally be the chip family ARM on Windows has always needed. Qualcomm's chips have always been dogs with slow off-the-shelf ARM CPU cores that have pathetic single-threaded performance compared to x86 AMD/Intel or ARM Apple Silicon designs.
    • kcb 53 minutes ago
      This will likely have worse single threaded performance than recent Qualcomm CPUs.
    • BoggleOhYeah 1 hour ago
      These chips also appear to be using off-the-shelf ARM cores.
    • TiredOfLife 5 hours ago
      Qualcomm Snapdragon x1 and upcoming x2 use their Oryon core and have much faster single-thread performance than Intel/Amd and this nvidia soc that uses off-the-shelf arm cores
      • pseudosavant 1 hour ago
        That wasn't true of the X1, but apparently the X2 (which is only in a single device so far) does appear to finally be fast. The first Windows ARM CPU to be faster than any of its x86 rivals. Competitive with Apple Silicon single-thread performance even.

        I was disappointed to see that the RTX Spark has the ARM cores from the DGX Spark. I was hoping it had their new in-house developed cores that Nvidia is starting to use on their latest gen server parts. They look really fast. That said, if RTX Spark has CPU performance like the DGX Spark, it will be almost as fast as the top AMD/Intel parts.

  • renoir 12 hours ago
    So basically Cerebras style?
    • KeplerBoy 12 hours ago
      Not at all. This is a more like what Apple has been doing the past few years. A bunch of decent arm cores paired with a beefy integrated GPU.
    • trvz 12 hours ago
      No.
  • officerk 11 hours ago
    This will crush the M5 Max going by the numbers. I'm curious to see how much they end up costing
    • Tiberium 11 hours ago
      It won't, the top tier RTX Spark has the same exact CPU and GPU as DGX Spark, so you can check DGX Spark CPU benchmarks to see how it fares. Spoiler: it's about M3 Max level. And they're only coming this fall.
    • aenis 11 hours ago
      Nah, still ~300GB/s memory bandwidth. That will be slower than the M5 max, by a wide margin for LLM inference.
    • Rekindle8090 11 hours ago
      M5 max is 3x stronger and 50% more power efficient. nice try though.
      • spwa4 8 hours ago
        ... but you'll be rewriting inference for any model that isn't a well-known LLM. Yourself.
        • wbolt 1 hour ago
          AI coding agents can do that pretty nicely already and it will only (slowly) improve over time.