Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Boris, you're seeing a ton of anecdotes here and Claude has done something that has affected a bunch of their most fervent users.

Jeff Bezos famously said that if the anecdotes are contradicting the metrics, then the metrics are measuring the wrong things. I suggest you take the anecdotes here seriously and figure out where/why the metrics are wrong.



On the subject of metrics, better user-facing metrics to understand and debug usage patterns would be a great addition. I'd love an easier way to understand the ave cost incurred by a specific skill, for example. (If I'm missing something obvious, let me know.)

Baking deeper analytics into CC would be helpful... similar to ccusage perhaps: https://github.com/ryoppippi/ccusage


This is useful if you want to keep an eye on what claude's actually doing behind the scenes: https://github.com/simple10/agents-observe


But the default 1M context window just rolled out a few weeks ago. If refreshing old sessions on 1M context windows is the problem, it's completely aligned with what Boris is saying.


We are taking it seriously, and are continuing to investigate. We are not trusting the metrics.


The quantitative ux research team at Google was created for exactly this problem: a service which became popular before the right metrics existed, meaning metrics need to be derived first, then optimized. We would observe users (irl), read their logs, then generate experiments to improve the behavior as measured by logs, and return to see if the experiment improves irl experiences. There were not many of us and we are around :)


I worked with Boris in the past and in my experience, Boris cares deeply about the customer. I'd vouch that Boris really cares about the issue people are running into.


But no other user has yet come and said "I worked with ajma in the past ..." so how can we trust your judgement about Boris?


I saw this guy named Claude saying ajma is a genius!



Nice try boris


[flagged]


Anthropic can't win in this case.

They don't use Claude Code, they get accused that they don't even trust it themselves.

They use Claude Code, they get accused the code is shit because it's slop.

I think dogfooding is known to be a legitimate approach here.


The idea is that Claude Code is surprisingly buggy and unrefined for something created by the very tool and processes that are supposed to be replacing us as we speak.


The idea is that sculpted ideal code is rarely the best choice.


At the same time I'd say sloppy code (human or AI generated) is rarely the best choice. I'd say the best is in between.


I don’t see (nor do I care about) the code and how sculpted it is. I perceive the tool as buggy.


And they don't use our version of CC, or with our settings. They have flags for internal use only.


> Anthropic can't win in this case.

Sure they can. The solution is pretty simple and in your own post. Choose either:

* Make the product good to the point code is no longer slop and shit.

* Stop hyping the quality when it isn’t there.

* Do a hybrid approach. Use their own product but actually have competent humans in the loop to make the code good.

This is not hard. Be honest and humble and that criticism goes away. It’s no one’s fault but Anthropic’s that they hype up their product to more than it can do and use it carelessly to build itself. It’s not a no-win scenario if you’re the one causing your own obviously avoidable problems.


Google products ux is widely acknowledged to be a steaming pile of shit though, so I am not sure you should follow their example.

Many of the metrics they use are obviously actively user hostile.


Metrics and quantitative ux results in really bad software, making it rigid while optimizing for the wrong things.

The most obvious example is Google creating multiple steps for Login where you have to enter your password after you put in your user.

I wonder what metric lead to that decision or was it a political decision to make it seem like their "old" software has some new feature.


If you mean Google website login, that step is needed because the email address is used to determine which identity provider to use. E.g. I have three different accounts that branch off from that same initial login flow.

One is my person "gmail.com" account, and the other two go through enteprise identity providers related to my employment and their G-Suite licenses. So after I put in one of these three email addresses, I get prompted for the appropriate next step. Only one of them involves giving a password to a Google server. The other two are redirects to completely separate login systems operated by my employer.


I mean I get it logically makes sense. But it still seems like a waste of time for a small percentage of use cases.

Maybe a better approach is put in your login have it automatically detect if it requires an identity provider. Gray out the password to signal to the user password is not necessary and automatically redirect.

Less clicking, don't break flow and think of a smoother solution.


Thank you


[flagged]


HN sometimes talks about pathological customers who will never be happy. Boris is probably the single best rep in the community, possibly ever.

The way your tone and complaints come across reminds me of this. As a paying customer ($5k spend per month in my corporate job), I’d rather anthropic keep doing what they’re doing — innovating and shipping useful stuff at blinding speed — and not index on your feedback. I think the tradeoffs they would cost far outweigh the consequences.


> Boris is probably the single best rep in the community, possibly ever.

When you say “the community”, what exactly are you referring to?


Dang man, chill.


Man, expecting the minimal from companies who are supposed to deliver a pro... there is no SLA for any this, so you are right.

Also, why is there no SLA?


You’re not getting a worthwhile sla on a subscription at this rate. What are you going to get? A few dollars? An sla isn’t useful unless it actually bites for the provider and actually compensates the customer. And it costs money - how much are you willing to spend for this insurance?


because there isn't one and people still paid for it.

My clients demand one, so there is one.


Imagine if people were like your clients.


If they were, they wouldn't buy your product without an SLA. But they're not.


Because this is ultimately a beta service. The whole industry is.


Wait, where is there a 'beta' tag to something that they are charging real money for? Why is this software any different than any other software and we should completely give away our rights as a consumer to ensure what we pay for is delivered?


I think the parent is saying that one should be aware that the whole LLM industry is still in an experimental stage and far from mature. What you want isn’t what’s being offered. I agree that there should be higher standards, but what we currently have is an arms race. The consequence is to factor that into the value proposition and maybe not rely too much on it.


SLAs should be standard for any paid service, especially on the enterprise side, but also on the consumer side. Being immature as a company does not excuse a lack of service delivery.


Not every customer, even a paying customer, demands reliability at a particular level. Market segmentation tends to address those situations: pay more, get more.


> pay more, get more

Users on $200 plan complaining, already at max level of subscription, I don't think a $200 subscription should make you feel like you are getting unfair advantage. Like restricting claude -p to API ... after I paid so much? Moderate use should not do that. I am not running it batch mode on a million inputs.


'I don't want to hold companies to account for failing to deliver services, therefore I think everyone else should live by my permissive "standards".'


They can be held to account when they fail to deliver what they promise! But what is promised for delivery is what's in the Terms of Service (i.e. the agreement). Nothing more. If it's not in there, you can't hold them to account for it.


Yes, that's the problem.

It's too easy for companies to fail to provide their service as long as they never promise to provide their service.


> It's too easy for companies to fail to provide their service as long as they never promise to provide their service.

I don't even know what this means. You can't make anyone work for free, nor dictate the terms of what kind of work someone will do without their consent. I assume you are not pro-slavery.


I'll make a very simple example.

The service at mcdonald's is providing food for money.

When their ice cream machine is broken, they fail to provide part of their service.

I'm not saying anything about "making" them do anything. I'm just calling out their failure and saying it's a bad thing.


You didn't merely call out their failure. You said it was "too easy," implying something more, like they owe you something. It's a pretty entitled point of view.


I don't think it's "entitled" to want companies to put some effort into avoiding those failures.

If the government did something, we could think of it as similar to passing inspection.

The other way to look at things is that the market isn't varied and competitive enough to punish the companies that fail this way.

They don't have to "owe me" anything for me to desire a different balance. My desire is fine.


"[W]ant[ing] companies to put some effort into avoiding ... failures" is not the same as "hold[ing] them to account". The former is "this sucks and I don't like it." The latter is "punish them or force them to do what I want!"--i.e., some sort of legal remedy.


If you can point to a consumer targeted service that provides and keeps their SLAs, I’ll be impressed.


What right as a consumer do you have that is pertinent here, other than to have the vendor adhere to the terms of the agreement you have with them?

Anthropic has many customers despite the fact that they have occasional problems. They’re not suing Anthropic because Anthropic isn’t promising in its agreement something they can’t deliver.

I think you’re reading into the agreement something that isn’t there, and that’s the cause of your confusion.


I am not reading into an agreement, I am saying there is no agreement to be found to ensure service delivery and the associated liability that would come for any SLA. Also, where is the Anthorpic SLA for Enterprise?

Does it exist?

Just because people pay for things doesn't mean they know or understand what they are paying for. Nor is there the legal precedence to actually understand where the rub lies or how that impacts business.


> Just because people pay for things doesn't mean they know or understand what they are paying for.

I believe, respectfully, that’s precisely what is happening in this thread because you keep complaining about the absence of an SLA that was never in the agreement, as though it is—or is supposed to be—there, and therefore the existence of some “rights” that would flow from that.


There are no SLAs, in any agreement, thats the problem.



It's incredible that Boris is here on HN being open and sharing an issue they don't fully understand yet, and offering a possible workaround. CTFO.

Thank you Boris.


I am sorry you feel this way, but the reality of the situation is there is zero reason to trust anything Anthropic or Boris says. They have no legal liability or obligation to tell the truth, besides brand risk, which to people like you is mitigated for a single person to show up, post, and thats it.


You should work at these companies and understand they have good intentioned employees otherwise they’d rarely pass the cultural interviews plus background checks plus backchanneling. Have a bit more faith in the employees


> Have a bit more faith in the employees

Have you been asleep for a decade?


lol it is _way_ too easy for people to talk like this behind a computer screen.


What truth do you believe you are not being told, exactly?


Dude is on hacker news on a Sunday. half the GDP of the world is competing with him. What metrics would you like to see?


An enforceable SLA with the services that Anthropic offers rather than putting an employee to respond to things on Sunday.


>> rather than putting an employee to respond to things on Sunday.

Maybe just maybe they didn’t put him here, rather he just a normal guy who reads HN, who is passionate about his role, and is here on his own time.


Maybe... maybe... maybe... none of this builds trust when there is something that does build trust; putting revenue on the line and opening yourself to legal liability. Otherwise everything is empty and meaningless, its just PR, and nothing more.


You can get a SLA and ZDR by choosing one of the Claude partners (eg on Bedrock)

https://platform.claude.com/docs/en/build-with-claude/api-an...


Then you should offer to pay them for one. I’m sure they’d love to hear from you, and they could probably deliver one to you for the right price. But it will be a high price.


They don't offer a ZDR [0] for files, even if you have a BAA or dealing with HIPAA data, no matter how much you pay them. Trust me, we have tried.

[0] https://code.claude.com/docs/en/zero-data-retention


I’m really confused. We were talking about SLAs, not other product features. Are you moving the goalposts?


There isn't an SLA nor is there any protections around file uploads to their services. Two, bad, things can be true at the same time.


Did you talk to them about purchasing an SLA? If so, what did they say?


I feel like you aren't really understanding what a Service-level Agreement actually is in practice. It's not a piece of paper with a specific number of nines and an associated price tag. They can be and often are very complicated documents that take multiple rounds of redlining to arrive at something both parties agree to.

If zero data-retention was non-negotiable for the customer, it's totally possible that the negotiations ended there.

I'm not sure what you're trying to accomplish or unearth beyond what's already been said, which certainly suffices for me.


As both an attorney and SRE, I understand what an SLA is. And you can absolutely get an SLA when you buy cloud services from many vendors, including AWS. Some vendors provide it at all price points; others include it at higher service tiers, without complex negotiations needed at all. And, yes, if it’s not on the menu, you may need to negotiate one. But you can’t conclusively say “they don’t offer one” unless you’ve actually gone to the company and asked.

https://aws.amazon.com/legal/service-level-agreements/

https://trailhead.salesforce.com/content/learn/modules/slack...

https://support.atlassian.com/subscriptions-and-billing/docs...

Before you casually accuse someone of not knowing what they’re talking about, first make sure you’re on firm ground yourself.


It seems like you could save a lot of time and confusion by talking about the SLA that you pay for from Anthropic instead of establishing your bona fides by posting links to various unrelated companies’ SLA pages.

Like how was your experience negotiating your SLA with Anthropic? What ballpark are you paying for the SLA with Anthropic that you have in place? How many 9s does your Anthropic SLA cover? Obviously you haven’t posted a half dozen times in this thread about how Anthropic by nature of existing offers SLAs without any knowledge of that, so some simple stuff about your SLA with Anthropic would be helpful.


I make no unqualified claims as to whether Anthropic offers an SLA. I never did. But I do know that it's unreasonable to claim they don't when you didn't even take the steps to conclusively determine it for yourself.

As I said: "I’m sure they’d love to hear from you, and they could probably deliver one to you for the right price. But it will be a high price."


Oh, well in that case, if posting URLs counts as proof of… something, there doesn’t appear to be any SLA page anywhere in their sitemap. https://www.anthropic.com/sitemap.xml

Maybe it is just common for enterprise SaaS businesses to offer SLAs without having a page about it though. Something like that could possibly be unjustifiably burdensome as well because it’s not like they could just type “make a page about how we offer SLAs” and have it magically appear


Not everything a business might be willing to do is listed on their public website.


That’s a good point. Having an SLA page is an indicator that a business offers SLAs, not having an SLA page is also an indicator that they offer SLAs, just secretly. If you think about it all of the people constantly complaining about uptime and saying stuff like “I would pay money for an SLA from Anthropic if I could” probably means that they are killing it with all those secret SLAs.

I mean obviously they have to offer them, because they exist, as otherwise you’d have to believe something crazy like “they don’t currently offer them” for reasons “that they haven’t disclosed”


Again, many companies will do things they don’t ordinarily offer for the right price. I’ve seen it happen myself (on both the buyer and seller side) on many occasions.

It goes to the extent of the company itself! Very few businesses publicize that they’re for sale or put their company’s purchase price on their website. But acquisitions happen all the time.

Anyway, I don’t appreciate your sarcasm coupled with what seems to be willful ignorance about how the world works, so I won’t be participating in this discussion with you anymore.


I don’t get it. If you wanted to convince everybody about a vast universe of secret business and your expertise in it, why would you start with telling people that weren’t able to get an SLA from Anthropic that Anthropic offers SLAs? And then admit that you don’t actually know and then double down?

Like if I wanted to convince people that In’N’Out has a secret menu (they do) I wouldn’t start by saying “They have the ingredients to make onion rings, therefore they sell onion rings” (they do not). They offer burgers with lettuce instead of a bun (“protein style”) though. That’s a fact that you can verify by going there or calling them and asking about it. I didn’t rely on my assumptions based on other fast food restaurants, I relied on my knowledge of the topic!

Edit: It seems like bad faith to admit that you’re using “probably” interchangeably with “I don’t know” and then editing in “for a billion dollars” several posts into a conversation.

I guess enjoy posting about entirely unrelated conversations in other threads though. (otterley’s post about my having previously had a short amicable exchange with dang in a different thread was deleted, but I’ll leave this part up. I think digging through people’s post histories to find unrelated grievances is icky, for lack of a better word, and wildly unhelpful for any type of discussion)

Even with the “for a billion dollars” addition, admitting “I don’t know” and “probably” are interchangeable doesn’t really change anything from a logical standpoint. Nobody argued against you not knowing, so I don’t understand the purpose of the repetition.


> why would you start with telling people that weren’t able to get an SLA

That hasn’t been established. There’s no evidence that they went to Anthropic and tried to negotiate one.

> that Anthropic offers SLAs

I didn’t. I said “they probably will for the right price.” There are two modifiers in that statement. And the price is unspecified. Their first offer could be a billion dollars. Too expensive? Negotiate down.


I would invite you to notice your interlocutor's assumptions, especially as revealed in his prior comment. Look at how he misunderstands the situation:

> If you wanted to convince everybody about a vast universe of secret business and your expertise in it...

> Like if I wanted to convince people that In’N’Out has a secret menu...

You are discussing business. He is understanding you to be attempting to "mog" him, because he cannot adopt a perspective wherein the conversation represents anything other than a vacuous social challenge or "brodown."

In short, you're wasting your time.


I am so old :(

I looked up “mogging” and I’d think “my assumptions about stuff are valid because I’m a lawyer and don’t know what you do” would count more as mogging than “that doesn’t quite sound right, this is a conversation about something specific and not your general cleverness” but I’ve got a Benny Hill archive to get through


Boring corporate Ai will surely come, but hey, lets enjoy the wild west while it lasts. I am grateful to see Boris come here to address problems people face. I 100% sure nobody is making him - he has one of the coolest jobs in the world.


>he has one of the coolest jobs in the world.

So that means we just eject any critical thinking when it comes to companies, especially where they is no liability or obligation for them (Boris or Anthropic) to be honest.

Other than 'trust'.


Don’t like Anthropic? Use a competing service. At this point the sheer volume of your commentary is not particularly complimentary to your own critical thinking skills. It’s not your job to correct the internet or to convince randoms of the rightness of your position. Of all the things in the world to be pissed at so insistently, this seems to be a pretty minor one.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: