This seems predicated on there being significant workloads that split gpu's betw...

vlovich123 · 2025-07-16T00:54:27 1752627267

Many of the GPU rental companies charge less for shared GPU workloads. So it's a cost/compute tradeoff. It's usually not about the workload itself needing the full GPU unless you really need all the RAM on a single instance.

privatelypublic · 2025-07-16T02:52:29 1752634349

Any examples to check out? The only one i know of is vastai... and there's already a list of security issues a mile long there.

diggan · 2025-07-16T12:34:26 1752669266

I don't think Vast.ai does "shared GPUs", you can only rent full rigs, at least there is no indication the hardware is shared between multiple users at the same time.

But I think services like Runpod and similar lets you rent "1/6 of a GPU per hour" for example, which would be "shared hosting" basically, as there would be multiple users using the same hardware at the same time.

huntaub · 2025-07-16T03:50:38 1752637838

My (limited) understanding was that the industry previously knew that it was unsafe to share GPUs between tenants, which is why the major cloud providers only sell dedicated GPUs.

bluedino · 2025-07-16T01:06:23 1752627983

NVIDIA GPU's can run in MIG (Multi-Instance GPU), allowing you to pack more jobs on than you have GPUs. Very common in HPC but I don't about in the cloud.

privatelypublic · 2025-07-16T02:42:41 1752633761

I thought about splitting the GPU between workloads, as well terminal server/virtualized desktop situations.

I'd expect all code to be strongly controlled in the former, and reasonably secured in the latter with software/driver level mitigations possible and the fact that corrupting somebody else's desktop with row-hammer doesn't seem like good investment.

As another person mentioned- and maybe it is a wider usage than I thought- cloud gpu compute running custom code seems to be the only useful item. But, I'm having a hard time coming up with a useful scenario. Maybe corrupting a SIEM's analysis & alerting of an ongoing attack?

cyberax · 2025-07-16T06:52:44 1752648764

No large cloud hoster (AWS, Google, Azure) shares GPUs between tenants.

shakna · 2025-07-16T21:25:56 1752701156

Is that not what AWS is offering here? [0]

"In multi-tenant environments where the goal is to ensure strict isolation."

[0] https://aws.amazon.com/blogs/containers/gpu-sharing-on-amazo...

cyberax · 2025-07-16T23:55:02 1752710102

This is for customers. AWS can use virtualization to slice their GPUs across multiple workloads (in their K8s), but AWS itself doesn't share GPUs.

privatelypublic · 2025-07-16T03:02:33 1752634953

Update: I thought for a second I had one: Jupyter notebook services with GPUs- but looking at google colab^* even there its a dedicated GPU for that session.

* random aside: how is colab compute credits having a 90 day expiration legal? I thought california outlawed company-currency expiring? (A la gift cards)

dogma1138 · 2025-07-16T08:27:34 1752654454

Colab credits aren’t likely a currency equivalent but a service equivalent which is still legal to expire afaik.

Basically Google Colab credits is like buying a seasonal bus pass with X trips or a monthly parking pass with X amount of hours. Rather than getting store cash which can be used for anything.

SnowflakeOnIce · 2025-07-16T02:50:32 1752634232

Example: A workstation or consumer GPU used both for rendering the desktop and running some GPGPU thing (like a deep neural network)

privatelypublic · 2025-07-16T02:54:58 1752634498

Not an issue- thats a single Tennant.

Which is my point.

spockz · 2025-07-16T04:19:54 1752639594

Until the GPU is accessible by the browser and any website can execute code on it. Or the attack can come from a different piece of software on your machine.

haiku2077 · 2025-07-16T00:58:31 1752627511

GKE can share a single GPU between multiple containers in a partitioned or timeshared scheme: https://cloud.google.com/kubernetes-engine/docs/concepts/tim...

privatelypublic · 2025-07-16T02:48:17 1752634097

Thats the thing... they're all the same tennant. A GKE node is a VM instance, and GCE doesn't have shared GPUs that I can see.

im3w1l · 2025-07-16T00:50:18 1752627018

Webgpu api taking screenshot of full desktop maybe?

Buttons840 · 2025-07-16T01:48:40 1752630520

Do you think WebGPU would be any more of an attack vector than WebGL?

privatelypublic · 2025-07-16T02:33:26 1752633206

Rowhammer itself is a write-only attack vector. It can, however, potentially be chained to change the write address to an incorrect region. Haven't dived into details.

SnowflakeOnIce · 2025-07-16T02:48:33 1752634113

How is it a write-only attack vector?

privatelypublic · 2025-07-16T03:14:32 1752635672

Rowhammer allows you to corrupt/alter memory physically adjacent to memory you have access to. It doesn't let you read the memory you're attacking.

There's PoC's of corrupting memory _that the kernel uses to decide what that process can access_ but the process can't read that memory. It only knows that the kernel says yes where it used to say no. (Assuming it doesn't crash the whole machine first)

SnowflakeOnIce · 2025-07-16T11:36:58 1752665818

Suppose you have access to certain memory. If you repeatedly read from that memory, can't you still corrupt/alter the physically adjacent memory you don't have access to? Does it really need to be a write operation you repeatedly perform?

extraduder_ire · 2025-07-16T21:57:55 1752703075

> Does it really need to be a write operation you repeatedly perform?

Yes. The core of rowhammer attacks is in changing the values in RAM repeatedly, creating a magnetic field, which induces a change in the state of nearby cells of memory. Reading memory doesn't do that as far as I know.

privatelypublic · 2025-07-16T14:32:28 1752676348

I probably should have called it "blind" instead.