From my perspective it’s just _confusing_ to work in AI right now. We have some ...

brookst · on April 7, 2023

You’re asking “what’s so big about GUIs? Literally nobody has asked to move a pointer around a screen”.

It’s the use cases these thing enable that are important.

Today, I wrote a draft product announcement. Only after I was done did I realize I had written it in a really impersonal third person (“users will be able to”). No big deal, but maybe 10-20 minutes of work to make it energetic and second person (“now you can…”).

30 seconds with chatgpt. “Rewrite with more energy, in the second person, using best practices for announcements”).

Six months ago I would never have asked for that. Today it was glorious and let me move on to focus on more important things.

nwienert · on April 7, 2023

I mean Google revolutionized search. Apple revolutionized personal computing.

OpenAI revolutionized… rewriting things with slightly different wording?

I’ve seen so many breathless people posting “this would have taken me so long to search” and then I type 3 keywords from their massive prompt they crafted and find it instantly on Google. We’re talking 1000x or more faster. I feel like the same is happening in your comment. How often have I thought “damn I wish I wrote this blog post ever so slightly differently” in my life? Maybe a handful of times? And yes I’m including all generalizations of that question.

But certainly fake girlfriends and summarization will be mid size fields. Image generation has some mid size potential. But these will be spread between many companies.

I really think it has uses no doubt, but is it a revolution? Where? It’s not creative in the valuable sense - media, art, fashion, etc all will adopt it marginally but ultimately it will actually only serve to further the desire for genuine human experience, and cohesive creativity that we see it really falls flat at. It saves some marginal time perhaps if you’re ok sounding like a robot.

Taking into account the downsides it looks like a hype bubble right now to me, and a draw in the long run. There’s just a whole lot of tech people trying to cash in on the hype.

jiggawatts · on April 7, 2023

You can program GPT in English.

Let me repeat that: You can program GPT in English. ENGLISH!

You're complaining about the first nuclear test bomb being impractical and uninteresting. How will this change the world? That huge monstrosity had to be affixed to the top of a test gantry and took years of effort by a veritable army of the best and brightest to make! No way it could change war, or geopolitics, or anything. No way..

This is the day after Trinity. The bomb has gone off. A lot of physicists are very excited, some are terrified, and the military is salivating. The politicians are confused and scared, and the general public doesn't even know yet.

That doesn't mean the world hasn't changed, forever.

ThalesX · on April 7, 2023

> You can program GPT in English.

> Let me repeat that: You can program GPT in English. ENGLISH!

How?

Let me repeat that: How?

I had a little script that from time to time parses a list of jobs from a specific board, extracts some categories, inserts them into an SQLite and have a frontend that displays them to me in a way I want.

The board has since changed some things which would mean maybe 2 hours of commitment from me to update the script.

How do I program GPT in English. ENGLISH! To do that for me? What are the steps involved? I've been using ChatGPT and GPT-4 for awhile and I can't imagine what the steps are to make this happen without a lot of back and forth. I can't imagine how to program the infrastructure. I can't imagine how the API endpoint is more than a fancy autocomplete. I need help understanding what it means that I can program it in ENGLISH! (I can also program it in my country's language for what it's worth).

> That doesn't mean the world hasn't changed, forever.

I sort of agree with this.

joenot443 · on April 7, 2023

> make this happen without a lot of back and forth

Perhaps this is the part you're missing. When I've watched people program with ChatGPT it _is_ a lot of back and forth because an enormous amount of context is able to be stored and back referenced. I.e. one wouldn't say "make me a Flappy Bird clone for iOS", they'd start with:

"Give me the code for a starter SpriteKit project". Then

"Now draw a sprite from bird.png and place it in the center of the screen".

"Now make it so the bird sprite will fall as if it's affected by gravity"

I won't bore anyone with how might one go from that all the way to a simple game, but I'm sure you see the idea. There are obviously _huge_ limitations to this approach and professionals will get hit them fast, but the proof is in the pudding: people who can barely code are producing real software through this approach. It's happening.

ThalesX · on April 7, 2023

> Perhaps this is the part you're missing. When I've watched people program with ChatGPT it _is_ a lot of back and forth because an enormous amount of context is able to be stored and back referenced.

I've tried to build a lot of fun stuff with it so far. Haven't been able to properly 'program it in English' for anything non-trivial. Back and forth ended up in loops of not what I wanted. I'm just utterly confused at the difference in experiences I've had with it vs. what some people are preaching.

> There are obviously _huge_ limitations to this approach and professionals will get hit them fast, but the proof is in the pudding: people who can barely code are producing real software through this approach. It's happening.

I've had 4 product people I know try to create products using ChatGPT. All 4 of them basically got stuck on the first steps of whatever they were trying to do. "Where do I have to put this code?", "How do I put it online?", "How do I store user data?", "Where do I get a database from?". Basic questions to any professional, but to them it was impossible to overcome the obstacles from code to deployment.

I don't doubt that it's happening and it will become better in the future; I'm just having a hard time trying to grasp where some people are coming from when my experience as a professional, using it, has been mixed.

jazzyjackson · on April 7, 2023

i've observed this schism between people who can get LLMs to produce useful output and people who are baffled, I think it's a mixture of two things:

expectations: using to the LLM to break problems into steps, suggest alternatives, using the LLM to help them think through the problem. I think this is the people using it to write emails - myself included, having a loop to dial in the letter allows me to write the letter without the activation energy needed to stare at a blank page

empathy: people who've spent enough time interacting with an LLM get to know how to boss it around. I think some people are able to put themselves in the LLMs shoes and imagine how to steer the attention into a particular semantic subspace where the model has enough context to say something useful.

GPT4 writes boilerplate python and javascript servers for me in one shot because I ask for precisely what I want and tell it what tools to use - I think because I have dialed in my expectation for what it's capable of and I learned how to ask in precise language, I get to be productive with GPT4's code output. Here's a transcript: https://poe.com/lookaroundyou/1512927999932108

thomashop · on April 11, 2023

Interesting point about empathy. Sorry I'm abusing the comment system to get back to your comment in the future.

int_19h · on April 7, 2023

Let me give you a simple example. I had to deal with a desynced subtitle file recently. I described the exact nature of the desync (in terms like "at point X1 the offset is Y1, and at X2 it is Y2") to GPT-4 and asked it to write me a Python script to fix this. It did require a couple tweaks to run, but when it did, it "just worked".

jiggawatts · on April 8, 2023

"Automatic Language-Agnostic Subtitle Synchronization"

Link: https://github.com/kaegi/alass

It's basically magic.

adammarples · on April 7, 2023

Honestly don't think it will be long before gpt can read this comment, then politely ask you for the urls of the job board and your git repo and 2 seconds later you will have a pull request to review

rovr138 · on April 7, 2023

You might find this interesting - https://github.com/Torantulino/Auto-GPT

> Auto-GPT is an experimental open-source application showcasing the capabilities of the GPT-4 language model. This program, driven by GPT-4, autonomously develops and manages businesses to increase net worth. As one of the first examples of GPT-4 running fully autonomously, Auto-GPT pushes the boundaries of what is possible with AI.

ThalesX · on April 7, 2023

Perhaps, but I'm talking about right now. I would love to be able to do this as I have 1000 ideas and no time to try them out.

ryanwaggoner · on April 7, 2023

I have some scrapers built in Scrapy, and from my experimentation with GPT4, I bet you could paste in your scraper code, the html source from the website in question (at least the relevant part), and tell GPT4 to update your scraper and you'd get something that's at least 95% correct within 30 seconds.

DANmode · on April 7, 2023

The same way people used to write code in the early days: trial and error, and a boatload of cursing!

alexvoda · on April 7, 2023

You can't program GTP in anything if you can't program.

If your prompt is garbage then the output will be garbage and if you don't know how to program you won't even realize the output was garbage.

It's not the language part of programming language that is hard. It's the programming part because it means you have to have a good understanding of what you want. Just like a human programmer won't read your mind an AI programmer won't read your mind either.

But I can already foresee bosses dismissing employees that raise issues (performance, maintainability, scalability, etc., etc.) by saying "Look, the AI can do it. So if it can do it you can do it too.". I foresee this because I have already seen it.

r00fus · on April 7, 2023

> You can't program GTP in anything if you can't program.

That's why it makes this so interesting - this type of automation impacts our jobs directly. Of course, I'm not sure who would use this in a corporate codebase without legal concerns.

pipes · on April 7, 2023

Work in a financial place, we have a contract with open AI so they don't hoover our user input. Normal URL is blocked.

JohnFen · on April 7, 2023

> Let me repeat that: You can program GPT in English. ENGLISH!

The very existence of "prompt engineering", numerous discussions about how to prompt ChatGPT in order to get the result you want, etc. imply that while it may be in English, it still requires similar care and attention to do properly as a programming language does.

Which makes me wonder what the advantage of using English is. A formal language seems like it would be more productive and accurate.

pixl97 · on April 7, 2023

For one, GPT-4 requires far less prompt engineering and generally interprets intent better.

The advantage of using English (natural language that is), the humans around you tend to speak it. I don't naturally speak powershell. Instead I want a script that searches for particular filenames, under a particular size, between a particular date in a directory path I specify. I told GPT I wanted that and in a few seconds it dumped out what I needed. It wrote the script in a formal language, which is then interpreted by the machine in an even more formal manner. Let the code deal with accuracy, and lets let language models argue back and forth with humans on intent.

JohnFen · on April 7, 2023

> The advantage of using English (natural language that is), the humans around you tend to speak it.

This is true, but of limited utility. English is so bad at this sort of thing that even native-speaking humans are constantly misunderstanding each other. Especially when it comes to describing things and giving instructions.

That's why we have more formal languages (even ignoring programming languages) for when we need to speak with precision.

int_19h · on April 7, 2023

That's the other nice thing about ChatGPT - if you say it something and it misunderstands, you can correct it by saying, "no, actually, what I meant is ...". Which, again, is how people generally do that kind of thing outside of programming. The advantage is that you're still remaining on a much higher level of abstraction.

As far as formal languages... GPT doesn't know Lojban well, presumably because of its very small presence in the training data (and dearth of material in general). But it would be interesting to see how training on that specifically would turn out.

JohnFen · on April 7, 2023

> Which, again, is how people generally do that kind of thing outside of programming.

Yes, and with people, that's insufficient if you really need confidence of understanding.

There's a reason that lawyers speak legalese, doctors speak medicalese, etc. These are highly structured languages to minimize confusion.

Even in less technical interactions, when you need to be sure that you understand what someone else is saying, you are taught to rephrase what they said and tell it back to them for confirmation. And there's still a large margin of error even then.

This is why, whenever I have an important conversation at work, I always send an email to the person telling them what I understood from our exchange. In part to check if I understood correctly, but also so that I have a record of the exchange to cover my ass if things go sideways because we didn't understand each other, but thought we did.

namaria · on April 7, 2023

Every text interface will eventually include a half baked implementation of a programming language.

valenterry · on April 7, 2023

> You can program GPT in English. > Let me repeat that: You can program GPT in English. ENGLISH!

What does that mean to "program GPT"? Do you mean program (software) USING GPT?

I thought we already had COBOL, which is pretty much like English so business people can use it. Same for SQL.

And don't we already have lots of low-code or no-code tools? Why do we need to program with ChatGPT if we already are beyond programming?

brookst · on April 7, 2023

Not the person you replied to, but I see it the same way. GPT is an English (and other natural language) compiler.

Not in the sense that you get a computer program out (though you can), but in the sense that it can automate anything without even needing a programming language, compiler, and domain specific UX.

Low code and no-code tools still require thinking like a programmer. You define what you need to do, then implement, then get results. GPT often lets you go directly from spec to results.

If the goal is programming, GPT is nothing special. If the goal is quickly reasoning over very abstract instructions, it’s amazing.

The trick is seeing the new use cases. It really does come back to the GUI revolution: if you want to list files in a directory, the CLI is just as good, maybe better. But GUI makes photoshop possible.

GPT makes it possible to say “summarize the status emails I sent over the past year, with one section per quarter and three bullet points per section”. And the magic is that is the programming.

newswasboring · on April 7, 2023

> What does that mean to "program GPT"? Do you mean program (software) USING GPT?

A sibling comment already explained the second part of the question, but there is something I find more exciting. You can program GPT, as in you can tell it to change its behavior. The innumerable "jail break" prompts are just programs written in English which modify GPT itself. Like macros in lisp I guess. The first time I truly saw this potential was when someone showed me you don't actually have to change the temperature of chatGPT in code, you can just tell it to give low and high temperature answers in the prompt[1]. That's programming the model itself in english.

[1] https://news.ycombinator.com/item?id=34801574

YurgenJurgensen · on April 7, 2023

The Ford Nucleon was a 1957 concept car that featured a compact nuclear reactor. Look at how well that prediction aged. It's apt that you mention the Trinity test, since 1950s inflated expectations of the applicability of nuclear everything are exactly where we are now.

Perhaps I could interest you in some Radium Water? It's new and trendy and good for your health.

alexvoda · on April 7, 2023

Wikipedia has a fascinating article:

https://en.wikipedia.org/wiki/Radioactive_quackery

palata · on April 7, 2023

> Let me repeat that: You can program GPT in English. ENGLISH!

For problems that are fine being defined ambiguously. Try to program a database in English, let's see where it goes.

jiggawatts · on April 7, 2023

You laugh, but this is why SQL reads kinda-sorta like English. People have tried, and failed.

Meanwhile, if you give Chat GPT your database schema and ask it to write a SQL query for a report, it can do that for you.

jazzyjackson · on April 7, 2023

If anyone wants to see the output of GPT4 when asked to define the tables and some sample queries for a hackernews clone in sqlite:

https://poe.com/lookaroundyou/1512927999932134

cudgy · on April 7, 2023

This is a very simple case that doesn’t reflect the complexity of a real project. Like so many attempts before to produce code, using a little effort, it degrades when the complexity level increases even slightly. Once there are more tables, ones that have names which cannot be easily translated from English, it breaks down quickly. These types of tools work ok for brand new projects, but work on existing projects will prove to be exponentially harder or more difficult than it is worth.

Nonetheless, it could prove useful for looking up algorithms, patterns, and generating boilerplate code. However, an important issue is will it generate similar code if queried at a later time? Not likely, which will make it less useful or result in an inconsistent codebase. Maybe you can request a version of the code generation? In-house code generators will generate consistent code, so it will be interesting to see how it is used in real projects.

int_19h · on April 7, 2023

Here's a more extreme example, using SQL as an API to give model access to game world state to reason about it.

https://gist.github.com/int19h/4f5b98bcb9fab124d308efc19e530...

Note that in this case it isn't even asked to write specific queries for specific tasks - it's just given one high-level task and the schema to work with.

You're right, though, that the effectiveness of this approach depends very much on schema design and things like descriptive table/column names etc (and even then sometimes you have to make it more explicit than a human would need). You really need to design the schema around the capabilities of the model for best results, which makes it that much harder to integrate with legacy stuff. Similarly, not all representations of data work equivalently well - originally, I gave the model direct access to the typed object graph, and it handles that much worse than SQL. So if your legacy software has a data model that is not easy to map to relational, too bad.

On the other hand, GPT-4 is already vastly better at this kind of task than GPT-3.5, so I think we can't assume that this will remain a limitation with larger models.

cudgy · on April 10, 2023

> You really need to design the schema around the capabilities of the model for best results, which makes it that much harder to integrate with legacy stuff.

This may end up being a feature of some high level frameworks … “compatible with ChatGPT” or “designed to work with xxx LLM”.

int_19h · on April 10, 2023

It will be very amusing if, eventually, our jobs as software engineers will be crafting bespoke AIs to maximize the efficiency of their use by an LLM.

eitland · on April 7, 2023

> and then I type 3 keywords from their massive prompt they crafted and find it instantly on Google.

Seems I and you have different Googles and you still have the one I had pre 2010.

For over a decade now, Google has been including things I never asked about to the point where it would sometimes be easier to find it using Marginalia.

Some say it is just because internet has changed and there is less ham and more spam, but the last few months I have been using Kagi and it proves it is possible to create a better search experience.

And, if Google works for you, fine. Maybe you search other topics, use other keywords or are in another bucket wrt experiments, but from my perspective Google is now the same as its predecessors.

nwienert · on April 7, 2023

I actually agree Google has gone downhill. Yet for the 8 or so examples I’ve tested where I saw hyped GPT results, every single one google answers, usually in the top snippet, always in the first result.

For politics shopping and some other topics it can be terrible, but I don’t think GPT is good at those either.

I’m actually happy to be proven wrong here. If you have some examples let’s test it out. If it’s a true step function improvement I’d expect it to be easy to source examples.

eitland · on April 7, 2023

Haven't used Google in a while but let me try.

jrumbut · on April 7, 2023

I think this is a classic case of us overestimating the immediate impact and underestimating the long term impact.

Right now, they are definitely useful time savers, but they need a lot of handholding. Eventually, someone will figure out how to get hundreds of LLMs supervising teams of millions of LLMs to do some really wild stuff that is currently completely impossible.

You could spin up a giant staff the way we do servers now. There has to be a world changing application of that.

uxcolumbo · on April 7, 2023

I”m not in ML, so excuse this maybe naive question:

> get hundreds of LLMs supervising teams of millions of LLMs

What does this mean or what can you do with this setup… do you mean running LLMs in parallel?

jacquesm · on April 7, 2023

Yes, that's called 'ensembling'. There is a lot of work being done on this kind of solution. One way in which it could work is that you can use multiple models that have been fine tuned for various problems and then use the answer that returns the highest confidence.

jazzyjackson · on April 7, 2023

You can also have adverserial generation where models given different expertises and attitude can go back and forth criticizing each others work

namaria · on April 7, 2023

Sounds like the 'dead internet' is just around the corner!

sdenton4 · on April 7, 2023

Ask the LLM to perform a complex task by splitting it into sub tasks to be performed by other LLM instances, then integrate the results...

shmoogy · on April 7, 2023

Is this something like langchain is working towards?

arthurcolle · on April 7, 2023

AutoGPT, BabyAGI

SanderNL · on April 7, 2023

We call that “companies”. We just need to apply what we learned in business school to a different set of workers, slightly deficient workers.

Eisenstein · on April 7, 2023

> We just need to apply what we learned in business school

Please don't. You've already ruined enough industries. Let the MBAs do finance and Wall Street and leave them out of the chain of command in organizations that make things.

brookst · on April 7, 2023

Every time you go to the store and find that the store is still in business and there is food on the shelf, it is because someone went to business school and knows how to optimize demand estimation, pricing, and logistics.

Yes, some MBAs fuck things up. Just like some CS grads fuck things up. But advocating against the study of business is just as naive as advocating against the study of computer science just because there are some bad CS grads.

Eisenstein · on April 7, 2023

> Every time you go to the store and find that the store is still in business and there is food on the shelf, it is because someone went to business school

Are you contending that business were not successful before Wharton started pumping out MBAs?

> But advocating against the study of business is just as naive as advocating against the study of computer science

I didn't say 'don't study business', I said 'stick to finance'. MBAs tend to end up destroying innovation and productivity for short term growth and stats.

Jack Welch showed what a successfully motivated 'business oriented' leader can do to an innovative and productive legacy organization when given complete control over it. The MBAs happen to just do it on a smaller scale.

tharne · on April 7, 2023

> advocating against the study of business is just as naive as advocating against the study of computer science just because there are some bad CS grads.

Criticizing garbage MBA programs is not criticizing the study of business. Business schools don't study business. They're a place where people make a lot of money selling theories about business that are useless at best and it many places, quite harmful. Learning about business by going to business school is like learning to kiss by reading books about kissing.

alexvoda · on April 7, 2023

That is a great analogy.

I would say that just as every person is unique so is every company unique. And just as there is plenty of pseudoscience plaguing psychology so are MBAs full of pseudoscience. Two fields that are far too obsessed with generalising their advice. Which is not to say that there aren't any useful ideas in these fields. But the vitriolic reaction above is warranted.

ResearchCode · on April 7, 2023

Stores existed before the MBA, but MBAs could be why food prices are up 30% since last year.

SanderNL · on April 7, 2023

Take shelter under my protective wings, O, sweet summer children.

mach1ne · on April 7, 2023

>Eventually, someone will figure out how to get hundreds of LLMs supervising teams of millions of LLMs to do some really wild stuff that is currently completely impossible.

This is an intuitive direction. In fact, it’s so intuitive that it’s a little bit odd that nobody seems to have made proper progress with LLM swarm computation.

jrumbut · on April 7, 2023

I've read about people doing it, I haven't read about people achieving anything particularly interesting with it.

It's early days. There will be a GPT 5 I'm sure, maybe that one will be better at teamwork.

Bjartr · on April 7, 2023

This sounds like that old economics joke that says it's impossible to find $20 on the ground, because if it had been there, someone would have already picked it up.

contemplatter · on April 7, 2023

In particular, it's odd that the greatest software developer in the world (ChatGPT) hasn't made progress with LLM swarm computation.

loandbehold · on April 7, 2023

How is "LLM swarm computation" different that single bigger LLM?

SanderNL · on April 7, 2023

The same reason why you don't let Mr Musk do all the work. He can't.

One LLM is limited, one obvious limitation is its context window. Using a swarm of LLMs that each do a little task can alleviate that.

We do it too and it's called delegation.

Edit: BTW, "swarm" is meaningless with LLMs. It can be the same instance, but prompted differently each time.

ParetoOptimal · on April 7, 2023

> The same reason why you don't let Mr Musk do all the work. He can't.

Better to limit his incompetence to one position.

akiselev · on April 7, 2023

I beg to differ. Imagine him taking down Twitter, Facebook, Instagram, and all the others in one fell swoop!

int_19h · on April 7, 2023

Context window is a limitation, but have we actually hit the ceiling wrt scaling that? For GPT, you need O(N^2) VRAM to handle larger context sizes, but that is a "I need more hardware" problem ultimately; as I understand, the reason why they don't go higher is because of economic viability of it, not because it couldn't be done in principle. And there are many interesting hardware developments in the pipeline now that the engineers know exactly what kind of compute they can narrowly optimize for.

So, perhaps, there aren't swarms yet just because there are easier ways to scale for now?

SanderNL · on April 8, 2023

I am sure the context window can go up, maybe into the MB range. But I still see delegation as a necessary part of the solution.

For the same reason one genius human does not suddenly need less support staff, they actually need more.

Edit: and why it isn’t here yet is because it’s new and hard.

staunton · on April 7, 2023

It's easy to distribute across many computers which communicate with high latency

alexvoda · on April 7, 2023

LLMs are already running distributed on swarms of computers. A swarm of swarms is just a bigger swarm.

So again, what is the actual difference you are imagining?

Or is it just that distributed X is fashionable?

pixl97 · on April 7, 2023

Rather large parts of your brain are more generalized, but in particular places we have more specialized areas. Now, you looking at it would consider it all the same brain most likely, but if you're looking at it in systems thinking view, it's a small separate brain with a slightly different task than the rest of the brain.

If 80% of the processors in a cluster are running 'general LLM' and 20% are running 'math LLM' are they the same cluster? Could you host the cluster in a different data center? What if you want to test different math LLM modules out with the general intelligence?

alexvoda · on April 7, 2023

I think I would consider them split when the different modules are interchangeable so there is de facto an interface.

In the case of the brain, while certain functional regions are highly specialized I would not consider them "a small separate brain". Functional regions are not sub-organs.

staunton · on April 7, 2023

Significantly higher latency than you have within a single datacenter. Think "my GPU working with your GPU".

alexvoda · on April 7, 2023

There are already LLMs hosted across the internet (Folding@Home style) instead of in a single data center.

Just because the swarm infrastructure hosting an LLM has higher latency across certain paths does not make it a swarm of LLMs.

staunton · on April 7, 2023

> There are already LLMs hosted across the internet (Folding@Home style)

Interesting, I haven't heard of that. Can you name examples?

alexvoda · on April 7, 2023

I read about Petals (1) some time ago here on HN. There are surely others too, but I don't remember the names.

1. https://github.com/bigscience-workshop/petals

bionhoward · on April 7, 2023

It’s a hype bubble for hundreds of years and saying that doesn’t make chatgpt worth any less. I have definitely been surprised by this and gotta say I’m expecting AGI a lot faster now. Even if literally all it did was predict what the average internet user would write in a certain context, that’s huge, cuz when you integrate all the little advantages of all the weird things one person knows another doesn’t, the collective knowledge is worth more than the sum of the parts. A tool which can tap into the sum total of human knowledge 24/7 and more rapidly than I can propose more questions for it, mainly I’m just excited to play with larger context size models so I can include more code and get big picture ideas about groups of stuff that are too numerous for my feeble meat brain to reason about. 7-9 things in working memory has always been the thing that would make humans inferior to AI in the long run. Even if it’s not that insanely smart (but realize: intelligence is a probabilistic concept and computers are great at multiplying probabilities precisely) if the thing can fit more stuff in memory than us and type faster than us and it doesn’t get tired or overwhelmed and give up (imagine your capability in a world where you had no tiredness and unlimited self discipline) in time it’s inevitable the transformers put us all to shame, and the more complicated the topic, the bigger of a shaming it’ll be, since the more complicated topics have exponentially more relations to reason about. Who’s gonna trust a human doctor to diagnose their stuff if the human brain holds 9 things and the AI holds thousands?

hutzlibu · on April 7, 2023

"Who’s gonna trust a human doctor to diagnose their stuff if the human brain holds 9 things and the AI holds thousands?"

The human brain can hold much more than 9 things and even though AI will be used in medicine broadly very soon, I really want the final diagnosis done by a human.

Once true AGI arrieves, I might change my opinion, but that might take a while.

kolinko · on April 7, 2023

9 things is considered a standard for working memory (kind of like processor registers), for people with ADHD it's even less - 3-5.

Try writing a number from one piece of paper to another. If it's more than 7-9 numbers, you won't do it in one shot, unless you spend extra time memorizing it.

JohnFen · on April 7, 2023

That can be increased quite a bit with practice. But it's also not important. It's just the cache memory -- it isn't the limit of what can be learned and recalled.

kolinko · on April 8, 2023

It is a limit on what you can reason about without a piece of paper.

I’m proficient at math, but my working memory is around 6, so I cannot add two three digit numbers to each other in my head (unless I see numbers to be added in front of me).

JohnFen · on April 10, 2023

OK, but I and nearly all of my friends can, so we have duelling anecdotes here.

alexvoda · on April 7, 2023

The equivalent for computers would be L1 cache on the CPU which is tiny.

kolinko · on April 8, 2023

More like cpu registers I would say :)

adalacelove · on April 7, 2023

> AI will be used in medicine broadly very soon

We have been hearing this since forever.

Revolutions do happen but not the way we expect. My anecdotical experience: no one in my team of about 30 people developing SW uses ChatGPT or similar in their day to day. This may change, or not.

staunton · on April 7, 2023

AI is being used in medicine already. For example, in diagnostics. Most new diagnostics devices (e.g. CT scan, cardiograms) include AI systems that suggest an interpretation and point towards possible problems that a doctor might occasionally miss.

Granted, currently deployed systems are mostly awful, way behind the state of the art, and therefore mostly useless. Maybe it's because designing medical devices and getting them approved takes so long. Maybe it's because the manufacturers put AI in there for marketing purposes only, while assuming nobody will use the suggestiona anyway. In any case, I strongly expect the trend to continue and these systems to become very useful quite soon.

JohnFen · on April 7, 2023

> Who’s gonna trust a human doctor to diagnose their stuff if the human brain holds 9 things and the AI holds thousands?

I will. As another commenter says, the brain isn't limited to 9 things at all. There's no way that I'll trust the diagnosis of a machine that won't understand me.

If a doctor uses AI to help with research, that would be OK. Just so long as the doctor is actually the one doing the diagnosis and prescribing the treatment.

esquire_900 · on April 7, 2023

The difference between your search query and theirs is clearly the level of expertise. Chatgpt has a great use case when you get started on a new subject; even with a very cluncky description it can point you into the core concepts of any field. Instead of reading 10 papers with are somewhat related but not what you are looking for, you can spend 3 minutes writing clumsy prompts and that's about it :)

copperx · on April 7, 2023

Exactly. Does anybody remember in the early internet days people laughing at their parents for googling things like "please help with my back pain my doctor sucks".

Well, who's laughing now?

rhtgrg · on April 8, 2023

To add to this, using ChatGPT feels great in the moment, because it seems to work so well. For example, asking it for an itinerary while traveling gives you something that looks great.

However, once you actually start using it and see that the "ten minute walk" is actually an hour walk, or that a full third of the attractions it has shepherded you to are permanently closed, you realize that building that itinerary yourself from scratch using Google or TripAdvisor would take you less time than manually double checking everything ChatGPT says.

It's also quite surprising that people still think ChatGPT is capable of logic. Even for a complete layperson, all it takes is asking it to draw someone's family tree as an ASCII chart to see that text prediction only goes so far and there's not enough of a relational concept in there to comprise knowledge. There are many examples of asking it to solve famous puzzles with minor variations where it fails spectacularly.

The marketing behind ChatGPT is genius, but there is only so far you can go before the honeymoon is over and people start to really question what you brought to the table. Aside from that, ChatGPT isn't unique in what it can do, and others (including open source) are catching up fast.

That being said, I'd still use it for something like language learning (and other types of learning), where follow up queries (such as why you'd use one word instead of another, or how to rephrase something to be more polite) unlock a significant amount of value. It can also be useful to write trivial code, though I doubt a serious professional would do this (for several reasons, such as privacy and liability). Ultimately, ChatGPT fits squarely under "tool" and not under "intelligence".

It seems that as of right now, the killer app of ChatGPT is the boost in views you get by putting it in the title of your YouTube video.

kolinko · on April 7, 2023

As for googling, here are some examples of queries you can try and see how it works:

- summary of all the carbon neutral concrete methods, especially ones that can be done in a small industrial workshop as a prototype

- I have allergies in Thailand, mid-february. What may it be related to?

- list all the companies from Japanese stock exchange that have high debt rate

Those are top of my head, but really anything that is either a super-specific niche, or requires merging a few niches together, Google won't help you with.

girvo · on April 7, 2023

How do you deal with it straight up lying? My problem with this whole system is, if I’m asking those questions it’s because I don’t understand the field well enough to answer it myself, which means I can’t pick up on if ChatGPT is lying…

SanderNL · on April 7, 2023

Fair, but not completely true. The Thailand examples gives a detailed reasoning. You can use those building blocks to check. If it says Thailand is a cold country and uses that in its argumentation, it's shaky. You don't have to be an expert climatologist to make this judgement.

It's not just one clean answer and we're done. In my experience it is helpful in breaking the problem down into stuff you can Google.

girvo · on April 8, 2023

> In my experience it is helpful in breaking the problem down into stuff you can Google.

Yeah I can see that being useful. I’ve also seen a lot of non-technical people straight up accept whatever comes out of it, so that’s a little worrying. It’s true of Google searches too, of course, but at least a google search gives N results someone can check rather than 1.

kolinko · on April 8, 2023

They straight up accept until they discover first mistakes :)

kolinko · on April 8, 2023

Fact check with google.

With the example questions I provided, it would take many hours to do research on the subject. GPT provided initial answers instantly, and then fact checking was easy.

That’s what we did with gpt-3. With plugins you can have gpt fact-check itself.

Also, if you have a system for dedicated knowledge, you can use embeddings - with embeddings gpt has very little room for hallucinations, and it can provide detailed references.

lionkor · on April 7, 2023

For all of those, you HAVE to be okay with a complete garbage answer. Are you?

alexvoda · on April 7, 2023

A machine generating confident bullshit will be the perfect companion for con-men to partially automate their workflow.

Humans are really gullible for the appearance of confidence. And humans are also very prone to wishful thinking.

rvz · on April 7, 2023

> Taking into account the downsides it looks like a hype bubble right now to me, and a draw in the long run. There’s just a whole lot of tech people trying to cash in on the hype.

Techies will realize that they are just giving ideas to O̶p̶e̶n̶AI.com, Microsoft Word, Google Docs and Notion. It is just the same AI bros re-selling their hallucinating snake oil chatbot that are under a new narrative for AI.

There is a reason why the only safe serious use-case of LLMs is summarization of existing text, since everything else it does is untrustworthy and is complete bullshit.

Their so-called 'revolution' is a grift.

fomine3 · on April 7, 2023

English speakers tend to miss a huge use case: Translation.

numpad0 · on April 7, 2023

I do wonder where LLM translator would take us to, considering that Japanese version of Bing Image Creator[0] is still proudly displaying a complete nonsense…

0: https://www.bing.com/create

    作成 *芸術* 開始日
    AI を使用した単語

lionkor · on April 7, 2023

DeepL.com? You dont need a general purpose LLM to do translation.

famouswaffles · on April 7, 2023

GPT-4 is a much better translator than Deepl especially for far apart languages.

rvz · on April 7, 2023

A 'use-case' done worse with LLMs especially with reliability. Translation is already done without a hallucinating LLM and can be done offline.

Summarization of existing text is the *only* safe and serious use-case for LLMs.

staunton · on April 7, 2023

How is summarization "safe"? The summary might be wrong just as well.

The use-case is anything where occasional bullshit output is an acceptable downside to speeding up. More reliable outputs will enable more use-cases.

alexvoda · on April 7, 2023

And what is a business that is fine with occasional (or frequent) bullshit output? Fake-news and spam.

staunton · on April 7, 2023

Every business is fine with some frequency of bullshit output at some level. The question is how often exactly it happens and how much harm the bullshit can cause.

alexvoda · on April 7, 2023

My point was that spam is the perfect use case for this tech. Of course there are other possible use cases, but spam and fake news content creation are the perfect fit. AI will enable one to easily clone the writing style of any publication and insert whatever bullshit content and keep up with the publishing cycle with almost zero workforce.

Want a flat-earther version of New York Times (The New York Flat Times)? Done. Want a just slightly insidiously fascist version of NPR? Done. Want a pro-Nato version of RussiaToday (WestRussiaToday)? Done.

And we already know people share stuff without checking for veracity and reliability first.

fomine3 · on April 7, 2023

Machine translation is already not reliable even without LLMs, so it's not weak point of LLM translation.

int_19h · on April 7, 2023

GPT-4 translates much better than anything else out there, esp. when it comes to idioms and manner of speech.

spondylosaurus · on April 7, 2023

Notion going all-in on the "AI" stuff is annoying/concerning to me. Mostly just that I live and die by a personal Notion wiki to keep my life organized, and if they eventually tank their service by investing too many resources into features that don't take off and I have to find a new tool to offload my brain into, I'm gonna be pissed...

eitland · on April 7, 2023

I went with Logseq and for the first time in a number of years (actually since OneNote 2016, the last self hosted version) I am actually happy with my tooling again.

It doesn't cover everything OneNote 2016 did, but it does a lot more in other areas and it is progressing nicely.

spondylosaurus · on April 7, 2023

This looks great! Do they have a (decent) mobile app? Being able to jot stuff down whenever, wherever is a make-or-break feature for me...

eitland · on April 7, 2023

The app is already usable, at least on iOS, but for now sync is a bit rough around the edges, i.e., I need to verify it is synced or it will overwrite and I have to fix it using the page history which thankfully exist.

alex-moon · on April 7, 2023

> Taking into account the downsides it looks like a hype bubble right now to me.

100% agree and so glad to see someone else say it. I feel like people are losing their minds every time we go through the same hamster wheel.

To hear first hand, in the article above, the effect this is having on ML engineers breaks my heart.

avereveard · on April 7, 2023

Llm are a tool that understand intent. It makes super easy to compose api into agent and give them a task https://python.langchain.com/en/latest/modules/agents/agents...

That is just the introduction, showcasing what level of sophistication you get with just Google and Wikipedia as tools

Now imagine task rabbit or fiver as tools. Ai can make things happen in the real worlds.

These llm have limited attention but infinite focus. You can parallelize them, you can have one direct a fleet of other llm, you can have llm checking input and outputs for correctness from the other models and feedback that information to the controlling model so that it can improve the promp to the other as it tries to reach it's goal

And the goal can be far fetching (manufacture fake artsy trinket and import them from China to distribute etsy) or nefarious (produce subtle propaganda in a moltitude of wordpress website, register accounts on Wikipedia, reddit, create a sophisticate network of citations)

xpe · on April 7, 2023

> OpenAI revolutionized… rewriting things with slightly different wording?

Yes, if you try hard enough, you can try to cast transformational shifts as trifling.

- e.g. “Barteen, Shockley , and Brittain made a smaller version of the vacuum tube.” (transistors)

- “Scientists discovered that light could carry information, like electrical wires do.” (fiber-optics)

The effects (including the harder to measure cultural shifts) matter more than some uncharitable characterization.

Also, the “it is not X” thinking is the result of present fixation. Such argumentation is, at best, quite narrow. Perhaps applicable in specific defined markets and situations but hardly a good mindset for making sense of how the world is changing. Hence the cliché, “The Stone Age didn’t end because we ran out of stone.”

The psychological undertones in the comment above are probably “people, stop exaggerating”. From one overreaction to another, it seems.

QuantumGood · on April 9, 2023

The popularity of ChatGPT revolutionized time. For learning, for many kinds of busywork (it's redefining what is and isn't "busywork), for planning. And most important: we don't know what we have yet because it's still being built. It's a tool. It's not the "capabilities" it's what people get out of it.

You mention blogging from the standpoint of writing it all yourself, and then using a tool to tweak it. That's not the revolutionary part. It's collaborating with the tool to write the post.

greeny7373 · on April 7, 2023

I need feauteres from n article description and chargpt can just extract it without any effort.

I don't know any other library I could just use with this task with nearly the same quality besides some regex soup.

kevviiinn · on April 7, 2023

You act like writing itself doesn't take time and energy. It has sped up my grant writing 6 fold. Any long-ish form writing that I need to do now happens at warp speed

melagonster · on April 7, 2023

if we only focus on this part. this function represents that most of content creator don't need to convert or reproduce their content to fit another group of customers.

onion2k · on April 7, 2023

That's very cool, and right now it's a good idea, but I strongly suspect GPT only looks clever and does a good job in isolation. If everyone starts using it product announcements will start looking very similar, and they'll lose a lot of their impact.

This is definitely the case with cover letters for jib applications. The ones written by GPT appear to be pretty obvious - my guesses could be wrong, but after seeing most applications not having a cover letter for years to most applications having one over the past few months, I suspect GPT is involved, and there's a distinct 'style' that seems to be showing up.

Using GPT could be the 'bootstrap ui' of product announcements. It looks great on its own, but put it next to a bunch of other companies and they all fail to stand out.

ptd · on April 7, 2023

Question for you-given two identical candidates: One who does not submit a cover letter and one who submits a cover letter that was clearly written by chatgpt—which candidate would you rather interview?

On one hand the duplicities involved with writing a chatgpt cover letter seem to be concerning in a new hire. On the other hand, it shows resourcefulness and going above the line.

I’m tempted to say I’d prefer the gpt cover letter candidate, simply to talk to them about how they got the idea and how they executed, but I’m curious if you feel the same way.

onion2k · on April 7, 2023

Right now I'd also have a preference for the GPT candidate because it shows a bit of an interest in new tech. In a year when 50% of candidates subject GPT cover letters I think I'll see it as a sign of laziness or trying to hide poor comms skills. Maybe not though. Time will tell.

JohnFen · on April 7, 2023

I'd probably roundfile the chatGPT one. A cover letter needs to be a personal communication from the candidate, not something machine-generated. The writing style is an important signal.

However, I would have zero issue if they used chatgpt to help compose their CV.

SanderNL · on April 7, 2023

I find it interesting it’s very hard to make it do typos. The type of errors humans make. It can do typos but even those seem weird. If I push it to make errors it either over- or undershoots.

The “correctness” of it is a definite give away IMO.

notahacker · on April 7, 2023

In fairness, lack of typos in a cover letter has traditionally been interpreted as a sign of diligence!

For more general queries its "house style" tends to be really obvious, with all it's "however, it's important to note" and "ultimately it depends on"s and the tendency to flesh out a one sentence answer to the specific question with two paragraphs or five bullet points of detail at a slight tangent to it...

dizhn · on April 7, 2023

It can imitate styles of little kids with their phonetic spelling. (Though this is so common in English, French etc but not Italian or Turkish where the spelling is very regular)

I am sure it can at least do very common mistakes like it's vs its or "would of" if prompted right since there's a huge body of that kind of work. Or maybe a human needs to add the finishing touches to make it look more human. :)

int_19h · on April 7, 2023

That distinct ChatGPT style is mostly the product of their RLHF, so it's what you get by default if you don't ask for something more specific. But it's fairly easy to tweak the prompt to make it use whatever style you want, including more terse, less apologetic etc. Don't forget that "write about X in the style of Y" was one of the first things that GPT models could reliably do, long before chat.

cgearhart · on April 7, 2023

Hmm…no, I’m not doubting the value of the interface. I’m asking what’s the “killer app”.

Maybe I should just ask ChatGPT to explain it to me…

joe_the_user · on April 7, 2023

It seems like the use of ChatGPT is something like "microtasks". Little things a given person could do but would rather not and so is able to delegate to an automatic thing whose output they can verify.

It seems like it's potential as of today is increasing or seeming to increase the productivity of a segment of white collar workers in the fashion that email and the web did (or might not have). A lot of researchers might not have need for this and so not understand this appeal of this.

cgearhart · on April 7, 2023

Thanks for your comment, I think this is actually very helpful framing! It at least gives me something more tangible to think about—“micro tasks”.

mos_basik · on April 7, 2023

I, too, have been experimenting a bit to see if/how an LLM might help me. This microtask framing jives with my experience.

One of the best examples so far for me (and it's truly micro) was at grocery store. Friend trying to figure out how big of a rice bag to get and avoid not finishing it before a long trip coming up. She knew she ate a couple of cups dry a week.

"I eat 2 cups dry rice per week. Can I finish a 25lb bag in less than 4 months?" "Yes" (it did show its work).

One shot, perfect response. I know this kind of computational thing is what WolframAlpha was for, but that wasn't nearly as reliable. I know I could figure it out myself, but I'd need to find a reasonable figure for the density of rice and probably do some imperial metric conversions and generally futz around for longer than one would want to stand in front of a pallet of rice bags.

jacquesm · on April 7, 2023

50 cups of rice in a 25 lb bag, 4 months = 16 weeks = 32 cups at 2 cups / week. I think there will be some rice left.

Maybe this is one of those examples where this tool gives a confident and wrong answer even when it shows its homework?

pixl97 · on April 7, 2023

Honestly this sounds more like a place for GPT/Wolfram API than just LLM alone.

JohnFen · on April 7, 2023

Really? You're using ChatGPT to do basic math you could easily do in your head?

safety1st · on April 7, 2023

If we're characterizing LLMs as a kind of input interface I'd point out that the first GUI was released in 1973 by Xerox and the first commercially successful GUI was released in 1984 with the Mac 128K. It took 11 years for someone to answer this question. Sure things move faster these days, but we're still only a couple of months in.

With blockchains there was also a fundamental technological breakthrough (I'd argue less revolutionary than LLMs). The problem was that everyone jumped the gun and proclaimed the killer app had been discovered too soon: cryptocurrency's incarnations to date have yet to demonstrate much utility apart from being a vehicle for speculation. Nakomoto invented the first distributed blockchain in 2008...

bionhoward · on April 7, 2023

It’s programming. The thing is really great at programming, and everything else is icing on the cake. They already got my $20 a month lol

Joeboy · on April 7, 2023

Anecdotally, I've had more writers (screenwriters and copywriters) tell me they're using ChatGPT than programmers. I think people here might underestimate how big a deal it is in "the real world".

valenterry · on April 7, 2023

Not really programming but generelly producing text (or maybe generally certain structure like ASTs at some point). Programming is a subset.

I expect that this will move lots of things into the world of text that weren't before.

copperx · on April 7, 2023

Bingo. No programmer seems to want to program without it anymore. Sounds like a killer app to me.

tuatoru · on April 7, 2023

Most of the economy involves moving around physical things. Construction, transport, nursing and related health occupations (physiotherapy, home aid, etc.), retail and wholesale logistics. Manufacturing employs another chunk. Services for agriculture, fishing, mining also employ more people than are directly employed in those industries and are mostly to do with machines and equipment. Utilities.

Most of the rest is high-touch. People want interactions with humans for important stuff, not with AI. Remote teaching was an unmitigated disaster for most school students: how will AI teachers do, do you think? Attempts at robot police and security guards haven't gone down very well to date. It'll be a while before there are AI EMTs and firemen.

So there are grounds for skepticism.

williamcotton · on April 7, 2023

Well clearly you’re going to be sitting on the sidelines for the next few years.

avidphantasm · on April 7, 2023

I guess if you don’t like writing, this is good for you. However, I like writing. Moreover, many people are lousy at it (not that I couldn’t improve). I’m not sure I want something trained on lots of mediocre writing doing editing for me.

williamcotton · on April 7, 2023

LLMs are trained on language. Not mediocre language. This is why models can be fine-tuned in one language and then see the benefits in other languages. How much longer with this fundamental misunderstanding of these models continue and how often will they be put forth by people who are worried about task X that they enjoy being replaced?

wickedsight · on April 7, 2023

> What am I missing?

I'm going to guess (assume) you probably haven't worked in a 'real' business. A place where elbow grease still does the majority of the work and where Windows 8 was only just phased out.

The killer app (to me) in case of GPT is GPT itself, not ChatGPT. ChatGPT just allows me to easily test use cases for GPT. There are many interesting use cases for those elbow grease businesses for GPT. For example:

Data entry. There's still a lot of data entry being done from unstructured text. Where specifics like names and addresses need to be extracted from letters and emails and contracts and other stuff. I've worked on these challenges before using different strategies and GPT blows my mind with what it can do just by asking to grab this data and format it as JSON. Is it 100% correct? Nope. You still need people to review the data (depending on the use case), but that already saves tons of work.

Categorization. Some companies still get tons of emails that need to be forwarded to specific departments. This is another thing that GPT does surprisingly well out of the box.

And that's just GPT. There are many other legacy business processes that can be automated (partially) by other models that are coming out right now. Even just 'segment anything' that Meta just released is incredibly useful for many use cases I've seen in my daily work.

Killer apps are always a combination of a technology to solve a real world problem. If you don't venture into the real world and only stay part of the tech world, seeing the killer app is very difficult and ends up leading to Juicero-like products.

version_five · on April 7, 2023

Right, questioning the value of a chatbot is a good indication someone hasn't had real job. Totally legit characterization that doesn't make the whole thing sound like a confidence game.

rlt · on April 7, 2023

> Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize. Assume good faith.

https://news.ycombinator.com/newsguidelines.html

> questioning the value of a chatbot

Original commenter was questioning the value of the entire field.

> hasn’t had a real job”

This is different than “haven't worked in a 'real' business”.

version_five · on April 7, 2023

Ironic given yours and others emphasis on my choosing "chatbot" to summarise llms as a reason to dismiss my comment, along with the rest of the pedantry. The upstream post dismissed / insulted the person for questioning value, which was what I called out.

If you really had wanted to get into the "HN rules" game, you could at least have cited "don't be snarky"

wickedsight · on April 7, 2023

> dismissed / insulted the person

That wasn't even remotely my goal and I'm disappointed that my choice of words made it seem like it was. I purposefully added quotes to the word 'real' in my comment since any business is a real business and made it clear it was an assumption, not a fact.

It's just that many tech workers often haven't worked outside of tech and therefore are blind to issues outside of the tech world, like manual data entry, because they assume that must all be automated. It's exactly the same the other way around, people in what I called 'real' businesses are blind to what tech can to to improve business processes because they have no clue about what's available and possible.

xpe · on April 8, 2023

Thanks for putting in the effort to clarify. I get it.

Translation: “Real” businesses sort of make their own gravity.

version_five · on April 7, 2023

Fair enough, thanks for clarifying.

rlt · on April 7, 2023

You weakened the argument by portraying it as more insulting against a narrower claim than it was.

But yes, also snarky.

greeny7373 · on April 7, 2023

Real Job vs. outdated job.

You don't want to know how many companies still get paper bills scan them and add them to their system half manually.

And normal people without scripting experience never had the tools to just do a little bit of text analysis without tools like chargpt.

wickedsight · on April 7, 2023

They weren't questioning the value if the chat bot, they were questioning the value of the models based on only the high profile use cases such as a now (in)famous chat bot. That, to me, showed that they have a narrow view of the problems that this stuff could solve when applied outside of those high profile use cases. So I made an (explicit) assumption that they mostly worked in tech and not outside of tech, which limits the view of the world outside of tech.

VoodooJuJu · on April 7, 2023

I don't think donkeyd was trying to insult the person he was replying to, but he definitely could have worded that better. I think he's using "real job" in a derogatory sense directed at businesses, how most of them are so inefficient and rote that they'd stand to gain from the roteness of something like GPT, and that the person he's replying to, perhaps fortunately, hasn't experienced that type of business.

epigramx · on April 7, 2023

they mean they didn't have a job where neither ai or bad workers should be used.

stepbeek · on April 7, 2023

Data entry is a really interesting one to me. We’ve been replacing a legacy (read > 1 MLoC with no tests), system on and off for a few years. The original system had a ton of double entry or manual data entry and the human error rate is noticeable. If GPT could have automated this with a similar or reduced error rate then we would have considered it a win.

The real win long term remains killing off manual entry any time it’s possible, but GPT offers a nice patch.

naavis · on April 7, 2023

> There's still a lot of data entry being done from unstructured text. Where specifics like names and addresses need to be extracted from letters and emails and contracts and other stuff. I've worked on these challenges before using different strategies and GPT blows my mind with what it can do just by asking to grab this data and format it as JSON.

Is this data confidential or something you are willing to send to anyone? If the former, you probably shouldn't be sending it to an AI company that retains the data for its own purposes.

wickedsight · on April 7, 2023

This is very confidential data, which is why the current implementation is run on-premise and of course I'm definitely not using production data to test GPT capabilities.

naavis · on April 7, 2023

That's good! There have recently been news of companies leaking trade secrets and confidential data by sharing it with ChatGPT: https://www.techradar.com/news/samsung-workers-leaked-compan...

wickedsight · on April 7, 2023

Well, in the organization I was doing this, they had similar issues before I started, with external consultants doing stupid stuff. Luckily, my team wasn't as stupid as these people and integrity was also very high. Which was very, very necessary considering the data we were working with.

throwaway2037 · on April 7, 2023

Nice call about "Categorization". I would put a lot of "Data Entry" in the same category. Excellent ideas!

zirgs · on April 7, 2023

Latvian is a language, that's spoken by less than 2 million people. It's irrelevant outside of Latvia.

All of our government funded researchers who worked on natural language processing can now throw their work in the trash and resign. ChatGPT is leagues better than anything that they've done. And OpenAI weren't even trying.

Windows wasn't even localised in Latvian properly until very recently. Google translate still spits out ridiculous translations. (Though it's better than before). Most software isn't even available in Latvian. Almost no video games are in Latvian. Only the most popular books and movies are being translated. Interested in something less popular - you better learn other languages.

And now ChatGPT comes out and I can ask it to write C++ functions in Latvian. I don't need to learn English to be a programmer any more. Nothing like that has been done before. It translates stuff way better than google. And it will only get better.

Imagine that there's a book that only 50 people from Latvia are interested in. Human translators aren't going to bother. But ChatGPT can do it easily.

This is a very big deal.

jimsimmons · on April 7, 2023

Why is Latvian NLP research in trash?

Real researchers build on advances not quit because of them. I'm sure GPT is not as optimised as it could be to process non-English text. There's clearly a lot of work to do. At least this is true for south asian languages and I'm sure is true even for popular Western languages like French or German

yobbo · on April 7, 2023

Most pre-GPT NLP research is obsolete for all languages, but this was understood before ChatGPT.

int_19h · on April 7, 2023

This also works in reverse, BTW - think of all the obscure books, movies etc that were never translated because it wasn't worth the effort.

wrp · on April 7, 2023

Quality machine translation for less common languages would be a service worth paying for. Any links to examples and evaluation of ChatGPT capabilities?

famouswaffles · on April 7, 2023

https://github.com/ogkalu2/Human-parity-on-machine-translati...

version_five · on April 7, 2023

It's a hype bubble. There's a small group if die-hards in the tech-adjacent VC community that talk it up to no end and attack anyone who disagrees (I just saw a guy ask you if you were "virtue signaling" with your post - lol. And there's lots of people exploring how it could potentially be commercially useful - just like with blockchain. As everyone points out, it's got more going for it than blockchain did because it does something more tangible.

But the jury's still out on the long-run commercial potential of pretty good autocomplete or chat-as-search. It's probably more than zero (like blockchain), but it won't "change everything".

I'd guess that 90% people either agree with your observation or don't notice / care. It's just that these things bring out the vocal defenders, usually a sign people know deep down it's a bit of BS and post out of insecurity

dolebirchwood · on April 7, 2023

I've got access to GPT-4 at work. There have been many times that I'll encounter a bug while programming, I'll paste my code to GPT, tell it what my code is currently doing, and tell it what I actually want the code to do. It then explains what I'm doing wrong and what I can do to fix it. I've had success with this over 90% of the time. This saves me a significant amount of time that would have otherwise been spent hunting down solutions on Google, Stack Overflow, GitHub issues, etc.

I don't know what else to say other than that I would not willingly go back to life without GPT. The value speaks for itself to me.

ativzzz · on April 7, 2023

How simple are your bugs? The bugs I usually have to fix at work involve edge cases around complex user interactions that are less programming bugs and more "well we didn't really think about this particular user interaction when developing this feature" - things that are usually simple 1 liner fixes but can take hours to figure out based on back and forth with product to figure out if it's even a bug and tracing where exactly the data/interaction in the code comes from.

Until I can paste in my entire codebase and the entire history of the product development process into GPT I don't see how it can help.

The bugs that it easily fixes, are generally the bugs whose errors i can copy/paste into google and find an immediate answer on stackoverflow

ThalesX · on April 7, 2023

How is this possible? What do you guys do at work? I haven't had success with neither GPT-4 (did you build your own API calling tool for it? Do you just paste it in their Playground?) nor with GitHub Copilot in actually delivering anywhere close to 90% of the time. It usually misses a whole lot of context.

It feels like it would work for perfectly encapsulated small single purpose functions, which of course sounds great but in reality not many projects are structured like this.

dolebirchwood · on April 7, 2023

I'm doing frontend work (React/TypeScript), so that probably helps since I am working with relatively small components.

ThalesX · on April 7, 2023

I've used it to try and generate some rather small components in React / TypeScript myself, and what it did to arrays of refs with hook calls inside useState hooks initialization function, and the fact that I couldn't get it to fix its issues by doing what people suggest ("just copy paste the error"), or by trying to reason with it, made me not trust it so much. The output code is also pretty low quality in my experience and opinion.

petesergeant · on April 7, 2023

I think you're wrong. I never bought into the blockchain hype, and blockchain added literally nothing to my life other than some speculation right at the beginning. I would pay $200 a month for ChatGPT, today, without thinking twice about it. That alone makes me think it's a much, much bigger deal than you think it is.

throwaway2037 · on April 7, 2023

Why is this downvoted? It is a good point. My flatmate has a PhD in comp sci, does AI/ML research. He is now paying for ChatGPT "pro" because he is non-native in English and needs a bit of help to improve when writing his papers. He said ChatGPT is so good that he is learning a lot about English by just reading the corrections/improvements. And yes, we have talked/agreed about the scam/sham of blockchain/defi.

epigramx · on April 7, 2023

> it does something more tangible

it does something more superficially apparent to naive people. decentralization is extremely more important. it will be one of the main ingredients of direct democracy.

jack_pp · on April 7, 2023

I thought about learning Go and Fiber to build the backend of a side project of mine and I did but as it goes for any new language / stack I wasn't feeling confident in it. Then ChatGPT came out and I thought what the hell, let's see what all the fuss is about.

So I asked it to write me a struct for a table with the "id, name, longitude, latitude, news" columns. That worked well, I was surprised it automatically inferred the data types for said columns.

Then I asked it to write endpoints for retrieving a record from that table and it did so perfectly which again I was surprised by. Asked it to add endpoints for adding records and retrieving all records. Again, no bugs, perfect code. At the end I asked it to create a python script to test the API and it did so flawlessly.

Next day I created a docker env with postgres and went to test the code but it didn't work, turns out it wrote it with mysql in mind so went back and told it to rewrite the entire code with postgres in mind and again it did so flawlessly so overall writing this small API endpoint took maybe 30-60 min.

Considering I was a total newbie at Go this probably would have taken me several hours to complete successfully and this code is basically just boilerplate. I don't care to learn it by heart so I can be more productive in the future. Now that I have ChatGPT I basically don't have to, I don't have to write python to speed up my dev time, I can just have ChatGPT write the basic stuff in a highly performant language. It removed the only drawback which was more boilerplate.

riffraff · on April 7, 2023

> Again, no bugs, perfect code

How do you know? Either you know enough Go to be able to tell, and thus you'd be able to write this yourself, or you don't and thus you can't really judge.

I mean, it probably is right, but this "I don't know something, but I trust what the chatbot told me" is what worries me about the rise of the LLMs.

bottlepalm · on April 7, 2023

Reading and validating code is way easier than figuring out how to write it in the first place. A lot of programming is the same - functions, conditionals, loops, libraries, etc.. The hard part is mostly semantics across languages. Way easier for GPT to do the first pass, then go through it and read up on the parts you don't understand.

jacquesm · on April 7, 2023

It isn't in my experience. Understanding code that you did not write for a problem that you do not fully grasp is a lot of work. It's hard enough when you did write the code and when you do fully grasp the problem, which you'd have to if you were to write it yourself. Typically that's what first draft code is for: to see if you actually understand the problem.

IanCal · on April 7, 2023

Do you find it takes more time to review a PR than it took for someone to figure out the solution and implement it?

copperx · on April 7, 2023

Understanding the code of a fully fledged application is hard.

But with ChatGPT you can build it piece by piece.

jacquesm · on April 7, 2023

With a brain you can also build it piece by piece, in fact I don't know of any other way of writing a large software system than doing it piece by piece.

copperx · on April 8, 2023

Sure, but the argument was that reading an entire application's code is hard, therefore GPT-4 is counterproductive.

riffraff · on April 8, 2023

Validating code you understand, sure. Validating code for a language you don't know, I don't see how you could.

E.g. say you speak python, how would you know how to cleanup memory in C, since you never had to do it? Would you even know that you have to?

alexvoda · on April 7, 2023

> Reading and validating code is way easier than figuring out how to write it in the first place

Not at all. I would even say it is harder because the code being reviewed is priming you.

All code is a leaky abstraction and if you don't know what to look for you just won't see it.

auggierose · on April 7, 2023

That's just not true. I expect software to become a magnitude shittier soon because of this attitude. After that I expect software to become unbelievably good, because thanks to AI we will have the ability to prove correctness of software much more cheaply than before, and to design on a higher level than before. After all, you don't want the AI to help you generate boilerplate code, you want the AI to help you avoid boilerplate code.

jack_pp · on April 7, 2023

It is true if you can understand the code it writes. Maybe you are worried junior programmers are gonna churn out code using ChatGPT but to be honest I'd rather trust code from ChatGPT than from a junior, I feel ChatGPT writes better code on average.

auggierose · on April 7, 2023

I'd argue you only THINK you understand the code. If it generates a 1000 lines, which look like they are doing the right thing, will you be diligent enough to go through every single one of them? This literally can only work for extremely boilerplate code (which is maybe the code most of programmers write), but for most code I write I need a mental model of it, and constructing that model myself is easier than to try to learn it from some code. Of course ChatGPT can work as an inspiration, especially for working with unfamiliar APIs.

jack_pp · on April 7, 2023

Because what the code does is simple enough, connects to a database, runs some queries and returns the results. I know it worked because I tested it and for that kind of code there aren't really edge cases. Even if I can't validate or catch syntax errors because I'm not yet experienced enough I can see the overall structure of the code and what it does and if it runs then there's no syntax errors.

It's like with riddles, someone asks you a riddle you think and you think and you draw a blank but if you are given the answer you can instantly validate it even if you didn't know the answer before hand, same with this.

avereveard · on April 7, 2023

On one hand one may not know the exact syntax for a function call but still understand looking at code where a function is defined and where it's called, those are two different knowledge set, the first doesn't transfer from other languages but the second does.

On the other hand you can ask gpt to write test and validate at the outside layer that data is transformed the way you need to.

alexvoda · on April 7, 2023

> I mean, it probably is right, but this "I don't know something, but I trust what the chatbot told me" is what worries me about the rise of the LLMs.

I share the same worry with regards to the way humans will use AI, along with worries about enabling various antisocial behaviours at scale.

xpe · on April 7, 2023

Stop looking for the “killer app”; that metaphor is not useful here. Language models don’t need to have one, two, or a hundred “killer apps”. They are likely going to be highly distributed. They can revolutionize every textual interaction point with any person, group, or organization. For worse and better.

Instead, pay attention to how industries and people redefine themselves. There are going to be winners and losers.

nazgul17 · on April 7, 2023

I found ChatGPT helpful in a bunch of diverse situations. I'll go into detail below, but overall I think the most valuable thing it can do is understand questions that takes both knowledge (that I don't have and want to acquire) and understanding (that prior AI technologies did not have) and provide meaningful answers. I suppose I find it very useful for my own education - with the understanding that I should use 2 or 3 grains of salt when reading its answers.

For example, I am studying Japanese and often encounter expressions that seems to mean the same thing; I would ask my teacher, but her time is limited. I can instead ask ChatGPT and only bring to my teacher the questions whose ChatGPT answers did not convince me.

Another example: I like understanding why there are certain steps in a recipe. It would be hard to find someone with the knowledge and time to answer the question, never mind I should pay for their time. ChatGPT can explain what I want to know at the level of detail that I want.

I was also able to get a decent understanding of a mathematical question I had no business understanding by recursively asking questions until I was able to link its answers to my own knowledge.

It was also able to answer questions about the Spring framework that I had while reading the documentation itself. In that context, going in rabbit holes severely slows down learning and has the potential to just get me lost.

glandium · on April 7, 2023

Be wary with ChatGPT and Japanese. As someone fluent enough and using ChatGPT for inspiration in writing, as well as to find words that either I have "on the tip of my tongue" but fail to remember, or find idiomatic expressions that I may or may not have heard of, ChatGPT can come up with weird hallucinations (Edit: to the point of making up words, I guess mainly because its tokens are at best at the character level ; knowing that, it's amazing it performs as well as it does, honestly).

I always double check with Google searches, but at least ChatGPT gets me somewhere where I can actually search for something useful.

Relatedly, sometimes, even with an initial prompt I've used in the past to make it do what I need to a text in Japanese, and despite everything I type being in Japanese, sometimes it decides to respond entirely in English.

It can also be really bad with context. Because in Japanese, the subject is often omitted because it's known from context, ChatGPT often mixes things up when rewording or summarizing.

roer · on April 7, 2023

Are you using GPT4 or 3?

glandium · on April 7, 2023

GPT-3. I assumed that's what most people mean with ChatGPT. Usually, people specify for 3.5 or 4.

Is GPT-4 significantly better with Japanese?

literalAardvark · on April 7, 2023

GPT-4 is dramatically better with everything except speed. The minutia most people in this thread are complaining about is almost completely solved by GPT-4.

For example I used it over like 30 minutes to conceptualize, solve, and write some code that draws a graphic for a simple physics problem (0-shot) that I could intuitively understand but had no (physics and math knowledge) tools to calculate properly and it was a great experience.

It let me pick and change how what properties i wanted represented on the graph, knew what center of mass means, how to calculate it for a weird object, got something that feels correct to me, drew it out with various representations where i used color, size, shape to represent the various distances, weights, clusters, etc.

My non programmer friends have used generative networks to make designs for prints, incredible and generally accurate folk art, full mobile games.

People are sleeping on this. Don't be them.

glandium · on April 7, 2023

Specific examples don't generalize though. My experience with ChatGPT (GPT-3) is that it is vastly better at dealing with English than Japanese. That is still probably true with GPT-4, but that doesn't mean there hasn't been progress in GPT-4 with Japanese, which a sibling comment says is the case. But it's not because it does better at $task that you can extrapolate to a different one. People have reported GPT-3.5 being better at some things, after all.

int_19h · on April 7, 2023

I cannot speak for Japanese, but I'd say that GPT-4 is better at Russian.

That the model is better in English is no surprise given that most of its training corpus is in English. In fact, based on the sentence structure of the output when it speaks in Russian, it's clear that what's happening there is some kind of real-time translation from English.

That aside, I have yet to see any task on which GPT-4 wasn't at least as good as, or better than, GPT-3.5. I'd love to experiment with that. Do you recall any specific examples?

literalAardvark · on April 7, 2023

Yeah that's fair, it may be that Japanese ended up getting pruned out. I sincerely doubt that though, it seems very, very unlikely.