Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We are going to drop blackblaze over this

We discovered this change recently because my dad was looking for a file that Dropbox accidentally overwrote which at first we said “no problem. This is why we pay for backblaze”

We had learned that this policy had changed a few months ago, and we were never notified. File was unrecoverable

If anyone at backblaze is reading this, I pay for your product so I can install you on my parents machine and never worry about it again. You decided saving on cloud storage was worth breaking this promise. Bad bad call



I'm going to drop Backblaze for my entire company over this.

I need it to capture local data, even though that local data is getting synced to Google Drive. Where we sync our data really has nothing to do with Backblaze backing up the endpoint. We don't wholly trust sync, that's why we have backup.

On my personal Mac I have iCloud Drive syncing my desktop, and a while back iCloud ate a file I was working on. Backblaze had it captured, thankfully. But if they are going to exclude iCloud Drive synced folders, and sounds like that is their intention, Backblaze is useless to me.


Bidirectional auto file sync is a fundamentally broken pattern and I'm tired of pretending it's not. It's just complete chaos with wrong files constantly getting overridden on both ends.

I have no clue why people still use it and I'd cut my losses if I were you, either backup to the cloud or pull from it, not both at the same time like an absolute tictac.


> I have no clue why people still use it

This is an instance of someone familiar with complex file access patterns not understanding the normal use case for these services.

The people using these bidirectional sync services want last writer wins behavior. The mild and moderately technical people I work with all get it and work with it. They know how to use the UI to look for old versions if someone accidentally overwrites their file.

Your characterization as complete chaos with constant problems does not mesh with the reality of the countless low-tech teams I've seen use Dropbox type services since they were launched.


This would be half OK if it worked, but you can't trust it to. OneDrive, for instance, has an open bug for years now where it will randomly revert some of your files to a revision from several months earlier. You can detect and recover this from the history, but only if you know that it happened and where, which you usually won't because it happens silently. I only noticed because it happened to an append-only text file I use daily.


A specific implementation (OneDrive) doing something dumb doesn't invalidate the entire paradigm though. Things work just fine elsewhere (Dropbox, Google Drive, Nextcloud, and Seafile are all solutions I've had good experiences with).


Agreed, I’ve been using Dropbox for 15 years with minimal issues. The key is to ensure it’s running and syncing with the proper settings on both machines.

What can get things into a weird state is if both machines are editing the same file while only one of them is actively syncing. But for basic backup and sync, this is extremely rare.


Even crazier is one drive has a limit on the total length of a file path, how is this even a thing that exists.


Unlimited strings are a problem. People will use it as storage.

No, I'm not joking. We used to allow arbitrary paths in a cloud API I owned. Within about a month someone had figured out that the cost to store a single byte file was effectively zero, and they could encode arbitrary files into the paths of those things. It wasn't too long before there was a library to do it on Github. We had to put limits on it because otherwise people would store their data in the path, not the file.


I remember someone telling me that S3 used to be similarly abused - people were creating empty files and using S3 like a key-value store somehow, so AWS just jacked up the price of S3 head-object API call to push people back to DynamoDB or whatever.


Just include filename size in file size for billing purposes?


Not sufficient, unfortunately. The strings for file paths are stored in wholly different infrastructure with wholly different optimizations. It probably lives in your database. You really don't want people just stuffing gigabytes into that, payment or no payment. Odds are you didn't plan your control plane around, "what if someone uses our strings as encoded data?"


They won't do it if it's not free


In the fine print, only to be used against bad actors (w/guarantee that filenames under x chars would never be charged), or that too problematic? building good faith into policy + "hiding" info...

Reason - to not overcomplicate or give appearance of nickel-and-diming


No, just charge for the amount of storage they use on your server. Not the amount of data you think you’re storing. In non-special cases these will be the same number.


Makes sense.

Would there be any engineering/management pushback on the customer side? "we have to write a tiny script", "this is non-standard" / "why are you the only ones who charge us for filenames?"

(have limited knowledge here)


Wow alright I have learnt something thank you


What do you expect to happen when your cloud storage file path is 5000 characters long and your local filesystem only supports a maximum of 4096?


You expect the files to still be accessible using relative paths. What do you expect to happen if your cloud storage file path is 50 characters long and is mounted in a folder which is 4050 characters long when PATH_MAX is 4096?

The sync application itself can handle this using openat(2) or similar and should probably be using that regardless to avoid races.


Ah, I forgot that the maximum path length is usually limited by PATH_MAX, it's the path segment that's usually limited by the filesystem.

Point taken, although I still think it's better for cloud storage services to err on the side of compatibility, i.e. what's the lowest common denominator between Linux, macOS, Android, iOS from 10 years ago and Windows 7?


Oh yeah... I remember Windows behaving weirdly when I tried to copy some files with long names into a deeper directory tree. And it was just weird behaviour - no useful error message.


Windows in particular supports at the API level paths tens of thousands of characters long, much longer than Linux. The problem is applications need to explicitly support such paths using the long path syntax, otherwise they're limited to 255 characters.


Yeah I thought there was some way of doing it, but weirdly it was explorer.exe that was behaving in odd ways.


Great point I stand corrected


Everything needs limits otherwise someone will figure out how to or accidentally break it.


I stand corrected you’re right


Except the GNU stuff, which has as a design principle "no arbitrary limits". Meaning no limits at all, not "no sane limits":

  Avoid arbitrary limits on the length or number of any data structure, including filenames, lines, files, and symbols, by allocating all data structures dynamically.
I assume they're relying on the OOM Killer and quotas to prevent DoSes all over the place.


I also have no clue why people use it.

You can build such a system yourself quite trivially by getting an FTP account, mounting it locally with curlftpfs, and then using SVN or CVS on the mounted filesystem. From Windows or Mac, this FTP account could be accessed through built-in software.


This reference is 19 years old this month, in case anyone who recognized it was still feeling young.


noooooooooooo!!!!!!!!


Wait a moment, you just gave me an idea for a product


1 out of a thousand people might do that, the others will buy the product. That's why people use it, most people don't want to build everything themselves.



But as usual it forgets the "For a Linux user" part.

If we remove the whole linux section and just ask "why not map a folder in Explorer" it's a reasonable question, probably even more reasonable in 2026 than in 2007. The network got faster and more reliable, and the dropbox access got slower.


LOL I should have remembered. :D Sorry!


This cannot be a serious proposal. You should probably talk to people who don't use technology because they love it, but because they need it.



It’s kind of wild to read through these comments and realize hn is still riffing on the same ideas. Is it e2ee? Does it run on Linux? Who would pay for something you can slap together in a weekend with a few bash scripts? Really highlights this community’s values, skills, and blind spots. Also a bit of a bummer that the privacy and open source situations today are even worse in many ways.


The equivalent of this is advice from a friend of mine who likes different teas, "just learn to read hanzi like I did and then you can select the ones you like". Apparently there's one called "government tea" (in the original) which he expected to taste of old leather, musty paperwork, and stale cigarette smoke.


or use rsync.net and a cronjob.


Obvious. Explorer even has support built in for transparent ‘native’ gui support. I’m not even sure why you felt the need to explain it in detail. Next you’ll be explaining how to walk. (/s, I loved it)


Slow as fuck compared to 2 synced dirs


I think this is a case of people using bidirectional file sync wrong. The point is to make the most up to date version of a file available across multiple devices, not to act as a backup or for collaboration between multiple users.

It works perfectly fine as long as you keep how it works in mind, and probably most importantly don't have multiple users working directly on the same file at once.

I've been using these systems for over a decade at this point and never had a problem. And if I ever do have one, my real backup solution has me covered.


+1. It works perfectly if your mental model is:

“Every file is only ever written to from a single client, and will be asynchronously made available to all other clients, and after some period of time has elapsed you can safely switch to always writing to the file from a different client”.


Bidirectional file sync is also in hot demand from people who don't know the words, "file", "client", "write", "async", "available", or "time"

:P


The fact that lay people can and will use a tool incorrectly does not mean said tool is not useful


Yep. See e.g. steam cloud saves, which is literally just Dropbox for your video game save files. Bidi sync is a super common pattern if you look for it, I'm surprised at all the hate it's getting here.


> And if I ever do have one, my real backup solution has me covered.

What do you use and how do you test / reconcile to make sure it’s not missing files? I find OneDrive extremely hard to deal with because the backup systems don’t seem to be 100% reliable.

I think there are a lot of solutions these days that error on the side of claiming success.


I agree. I use syncthing for syncing phones and laptops. For data like photos, which aren't really updated. It works very nice. And for documents updated by one user, moving between devices is totally seamless.

That being said i understand how it works at a high level.


It works perfectly fine if you're user that know how it works. I use it with Syncthing and it works coz I know to not edit same file at the same time on 2 devices (my third and fourth device is always on internet so chances propagate reasonably fast even if the 2 devices aren't on on the same time)

But the moment that hits normal users, yeah, mess


Hello, Jim from Backblaze here. I wanted to offer some insight into what happened with backing up cloud-synced folders.

It is true that we recently updated how Backblaze Computer Backup handles cloud-synced folders. This decision was driven by a consistent set of technical issues we were seeing at scale, most of them driven by updates created by third-party sync tools, including unreliable backups and incomplete restores when backing up files managed by third-party sync providers.

To give a bit more context on the “why”: these cloud storage providers now rely heavily on OS-level frameworks to manage sync state. On Windows, for example, files are often represented as reparse points via the Cloud Files API. While they can appear local, they are still system-managed placeholders, which makes it difficult to reliably back them up as standard on-disk files.

Moreover, we built our product in a way to not backup reparse points for two reasons:

1. We wanted the backup client to be light on the system and only back up needed user-generated files. 2. We wanted the service to be unlimited, so following reparse points would lead to us backing up tons of data in the cloud

We’ve made targeted investments where we can, for example, adding support for iCloud Drive by working within Apple’s model and supporting Google Drive, but extending that same level of support to third-party providers like Dropbox or OneDrive is more complex and not included in the current version.

We are currently exploring building an add-on that either follows reparse points or backs up the tagged data in another way.

We also hear you clearly on the communication gap. Both the sync providers and Backblaze should have been more proactive in notifying customers about a change with this level of impact. Please don't hesitate to reach out to me or our support team directly if you have any questions. https://help.backblaze.com/hc/en-us/requests/new

We are here to help.


Throw some clock skew into the mix and it’s even more hilarious!


Why is this downvoted?


The insult to tictacs.


Same. Specifically I was considering Backblaze for our company’s backups (both products, computers and their bucket for server backups. That is no longer the case as of the news.


Totally fair. If you end up taking another look, happy to help.


Natasha from Backblaze here. Just wanted to let you know that we do backup iCloud data, as long as the files are stored locally on your device (not just in iCloud as “optimize storage” / cloud-only files).

If iCloud is set to keep full copies on disk, Backblaze will treat those like normal files and back them up.

More details here: https://www.backblaze.com/computer-backup/docs/en/back-up-ic...

Happy to answer any questions if anything’s unclear.


At the top of this page:

>NOTE: >iCloud's most recent update prevents Backblaze from backing up files that iCloud synced. >To back up these files, download them to another local location where Backblaze can read them.

So which is it?


It's baffling that they did this little PR blitz across HN and Reddit and then didn't reply to any of the questions.


It’s not baffling. It’s time to get off of backblaze.


This is very well put, and echoes my sentiments! I had installed Backblaze on my own home machine many, many years ago, and it has saved my bacon a few times. Since then I've also installed it on any family members' machines that required backup and recommended it to friends. And I've been happy to pay for the service.

The deal was that Backblaze backs things up and I don't have to worry about it. Learning that it does not back things up is a punch to the gut. I am familiar with the exclusions and I have a look at that list to make I'm not missing anything from my backups. I had always thought the exclusions list was exhaustive.

Excluding other files and folders without telling me about it breaks the deal. Dropbox is important to several of the users I installed it for. Ignoring .git folders is another one that affects me and I had not known about. Ouch.

I will now have to look for alternatives. It has to be easy to install, run seamlessly on non-technical users' machines and be reliable.

I find it hard to be think of a worse breach of trust for a backup service than not to back up files!


Most people have recommended Arq to me. I will try it next time I visit my parents https://www.arqbackup.com/


Shsh... It's an excellent reliable no frills service. People might flood it.


Hello, Jim from Backblaze here. I wanted to offer some insight into what happened with backing up cloud-synced folders.

It is true that we recently updated how Backblaze Computer Backup handles cloud-synced folders. This decision was driven by a consistent set of technical issues we were seeing at scale, most of them driven by updates created by third-party sync tools, including unreliable backups and incomplete restores when backing up files managed by third-party sync providers.

To give a bit more context on the “why”: these cloud storage providers now rely heavily on OS-level frameworks to manage sync state. On Windows, for example, files are often represented as reparse points via the Cloud Files API. While they can appear local, they are still system-managed placeholders, which makes it difficult to reliably back them up as standard on-disk files.

Moreover, we built our product in a way to not backup reparse points for two reasons:

1. We wanted the backup client to be light on the system and only back up needed user-generated files. 2. We wanted the service to be unlimited, so following reparse points would lead to us backing up tons of data in the cloud

We’ve made targeted investments where we can, for example, adding support for iCloud Drive by working within Apple’s model and supporting Google Drive, but extending that same level of support to third-party providers like Dropbox or OneDrive is more complex and not included in the current version.

We are currently exploring building an add-on that either follows reparse points or backs up the tagged data in another way.

We also hear you clearly on the communication gap. Both the sync providers and Backblaze should have been more proactive in notifying customers about a change with this level of impact. Please don't hesitate to reach out to me or our support team directly if you have any questions. https://help.backblaze.com/hc/en-us/requests/new

We are here to help.


As others have said, this is something that should have been communicated very clearly. The reason for using Backblaze is to have my data backed up, and not to worry about it. You say so yourself on your website.

Could you also provide an exhaustive list of items that are NOT being backed up, e.g. the .git folder? I can't find any reference to that anywhere on your website or in the app. What else is not being backed up? I know about the exclusion list in the app, which I have adjusted to suit my requirements, but you need to be clear, explicit and upfront about what you are not backing up. This is critical information.


You should tell your support teams that, because when I asked them if local Google Drive files were excluded they affirmed, and directed me to instructions on how to dissolve my business account.

I'm not surprised that support was wrong, but I was somewhat surprised there was zero attempt at customer retention.


I'm missing the WHY this wasn't communicated.

Isn't it obvious you need to let your users know there is a new constraint they don't expect?

Seems incredibly short-sighted to me.. The trust you build for years gets destroyed in seconds..


Natasha from Backblaze here. Fair criticism. This should have been communicated more directly.

For context, this was driven by changes on the Dropbox side in how synced files are handled, which affects how reliably they can be backed up. But even with that, it should have been surfaced much more clearly.


I'm going to join the exodus, though for a different reason. Switched to Orbstack and ever since Backblaze refuses to back up saying "disk full" as Orbstack uses a 8TB sparse disk image. You can exclude it, but if they won't (very easily) fix a known issue by measuring file sizes properly I don't feel confident about the product.


I've also been bitten by the Backblaze/Orbstack combo, and was one of the early communicators with the OrbStack team about it.

While I agree Backblaze is overdue to exclude the sparse image, in fairness to them, no other online backup solution (other than Time Machine itself) handled it correctly either, at the time I was investigating this last year.

(I'm not even sure Apple itself handles it correctly in all cases. I had to migrate to a new macbook recently, and Migration Assistant hung while transferring my files. I deleted the sparse image, tried again, and then it worked. Possibly a coincidence, I admit, but Migration Assistant reliably disappoints me every few years.)


Also taking recommendations for a simple services I can install on my dads windows machine and my moms Mac that will just automatically backup the main drive to the cloud just in case


I've been extremely happy with Arq https://www.arqbackup.com/ for several years as a quiet backup solution, bring your own storage. I've done a few small restores and it's been just fine, and it automatically thins your backups to constrain storage costs.

Managing exclusions is something to keep vaguely on top of (I've accidentally had a few VM disk images get backed up when I don't need/want them) but the default exclusions are all very reasonable.


Second vote for Arq, still on the trial but I have zero issues.

I'm using B2 as the backend, ironically, along with a Hetzner Storage Box. It just runs in the background, has decent defaults for "Don't backup useless crap" etc.

I'm still debating whether to get the single purchase version or pay $60 a year for 5 computers + 1TB of cloud storage.


I like https://www.borgbase.com

It's set it and forget.

You will need to set it up for them, then you get an email (from borgbackup, not the client so it works when the client is not running) when a backup hasn't happened for a while.

As client there are more options now (like Vorta, from them), but I have had success with https://github.com/garethgeorge/backrest and the Restic backend.


Installed Carbonite on my parents’ computer something like 15 years ago, and it still works (every now and then my dad tells me he used it to recover from a bug or a mistake).

But I have no idea where the company currently sits on the spectrum from good actor to fully enshittified.


`rclone` with AWS credentials. Go make a bucket and a key that can read/write to it.

Set up your config to exclude common non-file dirs, or say "only `/Applications` and `Home` and that's about it. If it's a file then it's a file, and it will be synced up.


my parents also lost data that was supposed to be backed up on backblaze because they didn't use that computer for a month and when they turned it back on the hard drive was dead. apparently backblaze silently deletes backups more than 30 days old even if they are the newest backup and then happily keeps billing you for not storing your data.


Natasha from Backblaze here. This is a situation people rely on backups for, and I can understand how frustrating it is to run into this when you need a restore.

I also want to be clear, this wasn’t about saving on storage. It came from cases where backing up cloud-synced folders (like Dropbox) was leading to unreliable or incomplete restores because of how those files are managed under the hood.

When Dropbox began using reparse points for synced files, those files no longer behaved like standard local files. Because of that, Backblaze Computer Backup can’t reliably back them up or restore them. The current behavior is focused on ensuring we only back up data we can reliably restore, and we are actively exploring ways to better support Dropbox and data touched by other sync services.


If I can make a product suggestion, Backboard has a part of the interface that shows excluded folders. This should explicitly call out the Dropbox folder as being excluded. The software update that removed support for this should’ve included a pop-up that said “Dropbox is no longer being backed up” (though candidly if this pop-up existed on my dad’s machine and he didn’t see it, my mistake).

My frustration stems from paying hundreds of dollars over several years to pay for backup and then silently learning Dropbox was no longer supported when we went to look for it in our backup. We could’ve made other choices about how to store/bavkup our own files with better communication.


Why not just use backblaze as cold storage and use restic or another tool with a GUI to backup to it?


Well this wasn’t the promise backblaze made a decade ago when we started using their products.

Now I need a new solution that will work for my parents


Of course, I wouldn’t use their client anymore. Actually, I would have never used it from the start as it’s not open source. I think for backups there’s no better guarantee than that. I don’t mean because you could look at the source code, I mean because in my experience open source products tend to care more about their users than not. At least for such foundational tools.


Backblaze Computer Backup != B2 Cloud Storage

You can't connect to their Computer Backup service through third-party software.


When an org quietly degrades one of their products, you should expect this behavior to occur again.


That is a lot more expensive if you have more than a small amount of data.


Hello, Jim from Backblaze here. I wanted to offer some insight into what happened with backing up cloud-synced folders.

It is true that we recently updated how Backblaze Computer Backup handles cloud-synced folders. This decision was driven by a consistent set of technical issues we were seeing at scale, most of them driven by updates created by third-party sync tools, including unreliable backups and incomplete restores when backing up files managed by third-party sync providers.

To give a bit more context on the “why”: these cloud storage providers now rely heavily on OS-level frameworks to manage sync state. On Windows, for example, files are often represented as reparse points via the Cloud Files API. While they can appear local, they are still system-managed placeholders, which makes it difficult to reliably back them up as standard on-disk files.

Moreover, we built our product in a way to not backup reparse points for two reasons:

1. We wanted the backup client to be light on the system and only back up needed user-generated files. 2. We wanted the service to be unlimited, so following reparse points would lead to us backing up tons of data in the cloud

We’ve made targeted investments where we can, for example, adding support for iCloud Drive by working within Apple’s model and supporting Google Drive, but extending that same level of support to third-party providers like Dropbox or OneDrive is more complex and not included in the current version.

We are currently exploring building an add-on that either follows reparse points or backs up the tagged data in another way.

We also hear you clearly on the communication gap. Both the sync providers and Backblaze should have been more proactive in notifying customers about a change with this level of impact. Please don't hesitate to reach out to me or our support team directly if you have any questions. https://help.backblaze.com/hc/en-us/requests/new

We are here to help.


>We’ve made targeted investments where we can, for example, adding support for iCloud Drive by working within Apple’s model

Is icloud drive backed up or not? Please be clear. Which versions support this?

https://www.backblaze.com/computer-backup/docs/back-up-iclou... still says that it no longer backs up icloud drive files. Then it gives directions on how to back them up. Which is it?

Also, why did you stop backing up .git folders, sliently?


Hey Jim,

Appreciate the thoughtful response. I recognize the challenge that cloud-synced folders introduce into the file storage ecosystem and the challenges with online/offline files + storage loopholes that could take a cause an engineering challenge.

That being said, we’ve been nearly 6+ years backblaze users and we probably can’t rely on backblaze if it can’t support these tools that are now pretty standard services to have installed. As I mentioned above, the promise for us was “pay for backblazd and never worry about whether our files are backed up”. We’ll be looking for an announcement if you can bring back Dropbox support.

Julian


Dropbox itself should already keep version history of files for 30 days with the free plan, or more if you pay.


I have some feedback for Dropbox as well for losing the file, but we don’t pay for that so my expectations aren’t the same.

My dad had a file untouched in Dropbox for 2 years. He overwrote it 2 days prior to me trying to recover it from Dropbox/Backblaze. They said he couldn't access the version that was just overwritten because that was over 30 days old, which is not what the definition of 30-day history is....


You fail to understand the problem, I suspect.

Just because files are in bespoke folders, does NOT mean they are being backed up.

Example: I'm 1,016% over my OneDrive limit because I canceled my Microsoft 365 account due to their price hike to cover for AI costs. My laptop still pushes files there upon save thanks to Microsoft defaults (my desktop was moved to CachyOS long ago).

If I had been using Backblaze for backup, those files would not have been backed up.

Luckily, I'm a nerd and I'm way ahead of this (I moved away from OneDrive long ago and never deleted the files). Most folks aren't.

Backblaze should be alerting users when stuff isn't backed up. I've strongly considered their B2 offering for a big project. The fact that they changed this without proper notification has made me decide NOT to move forward.


I thought you said Dropbox overwrote a file, meaning that file was also being synced by Dropbox, and that Dropbox sync was working. So likely the older file version would also have been synced by Dropbox, which it then overwrote. Dropbox itself keeps old versions of files for 30 days. I think you're saying in this situation Dropbox wasn't syncing though?

My comment was pretty orthogonal to all the Backblaze stuff, which I realize now was confusing.


I think they’re saying that Dropbox didn’t have the original (old) copy of the file retained, even though the change was just 2 days ago, because the old version was ‘more than 30 days ago’. Which is bonkers.


I don't think that's how the 30 day timer works. Once a file is replaced by a new one, the old copy should persist for 30 days. So if it was overwrote 2 days ago, should have 28 days to recover it. But don't know about this situation


That it didn’t do that, despite that assumption, is what the op was complaining about.


This is the same reason I always had Backblaze -- stick it on all my computer-illiterate relatives' PCs and then I know there is a fool-proof backup when something invariably goes wrong.


What are you going to use instead?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: