WebAssembly, an executable format for the web

dchest · on Dec 11, 2017

    function readUtf8String(memory, pointer) {
       let s = "";
       for (i = pointer; memory[i]!==0; i++) {
          s += String.fromCharCode(memory[i]);
       }
       return s;
    }

This doesn't read UTF-8, this reads bytes and converts them to UTF-16 char codes, so it works only with ASCII or some other 0-255 encoding. To read UTF-8 into a JavaScript string (which is UTF-16), bytes should be properly decoded from UTF-8 (e.g. with TextDecoder).

arnoooooo · on Dec 12, 2017

Thanks ! I've renamed the function accordingly.

chrismorgan · on Dec 12, 2017

ASCII only covers values 0–127; this function allows 128–255 as well, which makes it ISO-8859-1 (latin1), the first 256 code points of Unicode being the entirety of ISO-8859-1.

kodablah · on Dec 11, 2017

I'd like to think one day that the non-web side of WASM could become a more deterministic and cross-platform-predictable LLVM IR. Right now it is too limited of course but as GC and threading specs come along, I can hope one day it will reach parity. Granted it isn't as low level, but that has benefits for compiler backend implementers (they can choose how they target their arch) and compiler frontend implementers (they don't have to do different things per platform). Basically, imagine if Java bytecode was developed less around OO semantics, wasn't lumped in with its stdlib by the community, didn't have encumbered JCK and other tests, etc, etc.

Edit: Changed "LLVM" to "LLVM IR" to clarify as requested.

RussianCow · on Dec 12, 2017

To me, this is the most exciting prospect of WebAssembly by far: an IR that works natively across all major platforms, including the web. It's funny because it makes Gary Bernhardt's "The Birth & Death of JavaScript"[1] seem a lot less like a joke.

[1]: https://www.destroyallsoftware.com/talks/the-birth-and-death...

mike_hearn · on Dec 12, 2017

I'm not sure your criticisms of Java bytecode are all that reasonable to be honest.

You don't have to use the Java standard library to target bytecode. Scala, Ceylon, Clojure provide their own stdlibs. Indeed not doing this is one of Kotlin's competitive advantages.

Although the JVM wants to think in terms of objects, this is only really mandated as a unit of linkage and garbage collection. You could equally criticise C runtimes for being too oriented around procedural programming semantics because they see the world in terms of top level functions and global variables. You can write non-OOP programs in Java bytecode and Eta is an example of using the Haskell Spineless Tagged G-Machine representation on top of the JVM. If your runtime doesn't have any concept of an object-like thing, then it gets rather hard to do garbage collection and certainly any kind of moving GC.

Finally, the JCK is encumbered to protect the Java trademark. We can argue about whether that's really been so useful given Android, but the principle makes sense - it's about ensuring that if someone claims they run Java bytecode, they actually can run it. If WebAssembly isn't trademarked and protected in the same way then it'll be a repeat of HTML5, where a claim of support is essentially meaningless and everyone has to constantly consult giant feature/bug tables to figure out what subset of the spec works on your particular implementation.

Now it may be that in the end everyone is OK with the word WebAssembly being more of a statement of intent than a precise claim, just like we muddle through with the word HTML5. And in reality we all know that only Mozilla, Microsoft, Google and maybe Apple will implement wasm or at least only these companies will have implementations anyone cares about. They will probably comply with whatever test suite is produced without needing legal/trademark incentives. Maybe.

titzer · on Dec 12, 2017

> Now it may be that in the end everyone is OK with the word WebAssembly being more of a statement of intent than a precise claim, just like we muddle through with the word HTML5. And in reality we all know that only Mozilla, Microsoft, Google and maybe Apple will implement wasm or at least only these companies will have implementations anyone cares about. They will probably comply with whatever test suite is produced without needing legal/trademark incentives. Maybe.

There are already projects to implement WASM in other contexts beside the web--some even in production. The specification, which has been entirely mechanized with an implementation of a provably-correct interpreter in Isabelle, is designed to be implemented easily. We welcome other implementations of the core execution spec of WASM and other embeddings. I can only speak for myself as a co-founder, but I believe most people involved in the W3C groups hope the scenario laid out in this comment does not happen in reality.

mike_hearn · on Dec 12, 2017

Yes, but all those things are true of other platforms as well. The JVM's type system has been proven correct. Everyone who writes specs designs them to be easily implemented (in their view) and everyone hopes the spec won't be forked by vendors or end up with widely deployed but buggy implementations.

The worst case scenario for wasm or any spec author is that someone comes along and either implements it very poorly, or adds vendor extensions that effectively become de-facto standards. E.g. Chrome might add new opcodes or APIs that rely on other Chrome-only features. And then people start claiming they've got a wasm module, but it doesn't work properly or well enough on other implementations.

That's what happened to the web, to Java and others, and is one reason why the TCK is "encumbered", as the OP put it. It's also why Android requires you to pass an encumbered test suite to be able to use the trademark/logo/get access to the Play Store etc. It's not like one path is inherently superior to another. Protecting a trademark by licensing the test suite and mandating it be passed is about risk management.

titzer · on Dec 12, 2017

> The JVM's type system has been proven correct.

I'm not sure which proof you are referring to, but the Java and Scala languages both have unsafe type systems:

https://2016.splashcon.org/event/splash-2016-oopsla-java-and...

(Admittedly this is due to parametric types which are erased).

The JVM bytecode verification algorithm is over 100 pages in the spec.

> E.g. Chrome might add new opcodes or APIs that rely on other Chrome-only features.

We aren't doing that currently. We plan to continue participating in the W3C processes which have worked well over the past 2+ years.

mike_hearn · on Dec 18, 2017

I'm talking about the JVM type system, not Java's. They're not quite the same due to generics erasure and a few other things.

We aren't doing that currently

All browser makers have long histories of forking the platform with proprietary extensions more or less whenever they feel like it. So what does this assurance mean? Indeed that's now how HTML5 is meant to evolve! Nobody even uses the W3C anymore do they? It's all just throwing whatever extensions get built by browser makers into HTML5.

I'm sure WebAssembly will have its share of compatibility matrices and vendor-prefix equivalents soon enough.

kodablah · on Dec 12, 2017

The thread's a bit old, but I'll reply anyways.

> Scala, Ceylon, Clojure provide their own stdlibs.

On top of the JVM ones when targeting the JVM. E.g. java.lang.String (Kotlin too)

> You can write non-OOP programs in Java bytecode

Everything is still an object and all code is in methods. There is no way around that. Also, there are many other things like forced GC and what not that marry the bytecode to certain paradigms.

> it's about ensuring that if someone claims they run Java bytecode, they actually can run it.

And about their impls of java.lang and other base packages. And a way to make sure that licenses that Oracle doesn't like, such as non-OpenJDK-derived open source stdlib impls, are not given the tests. Other languages with compat tests publish them for all and yet aren't worried about losing their name. This is dangerous reasoning to apply to keeping things opaque.

It isn't just those big companies implementing WASM. I implemented a WASM backend for the JVM [0]. The core test suite, which I use and comply with 100%, is available to all [1].

0 - https://github.com/cretz/asmble/

1 - https://github.com/WebAssembly/spec/tree/master/test

wyldfire · on Dec 11, 2017

Your comment was confusing to read. I think it would be easier to understand if you replaced "LLVM" with "Intermediate Representation".

rhelmer · on Dec 12, 2017

This use case is interesting to me too. I looked into this a few months ago and found that there is a page on non-web embeddings:

http://webassembly.org/docs/non-web/

stromgo · on Dec 11, 2017

Those of you who tried WebAssembly, what kind of performance are you getting compared to native? In my application, execution time is about 200% of native, which is quite bad compared to the few benchmarks I could find: https://hacks.mozilla.org/2016/10/webassembly-browser-previe... https://blog.acolyer.org/2017/09/18/bringing-the-web-up-to-s...

It's a head-scratcher as my code is nothing special and I don't see what I could be doing wrong. Then it occurred to me that these benchmarks look suspicious, with many scores below 100%. Are there fair benchmark results somewhere, measured by someone not trying to promote WebAssembly? What performance ratios are HN readers getting currently, and what performance ratios should we expect in the long term?

vanderZwan · on Dec 11, 2017

Do you have your code online somewhere? What kind of tasks are you trying out? How much time is spent sending data back and forth between WASM?

jdright · on Dec 11, 2017

https://floooh.github.io/sokol-html5/index.html

You can compile these samples natively and compare yourself.

steveklabnik · on Dec 11, 2017

I haven’t done much personally yet, but I hear it can vary quite a bit between browsers.

tux1968 · on Dec 12, 2017

Am I being paranoid to think this is going to be heavily used to introduce proprietary and locked down protocols and content? How long is it until this is used to render web content onto a canvas without using individual DOM elements that can be stripped by ad blockers etc.

We're all focused on the potential performance rather than what this will actually be used to deliver -- so maybe i'm just missing what prevents this technology from subverting the open web.

klodolph · on Dec 12, 2017

"The open web" is a platform where everyone can publish and access content because the standards are open. WebAssembly makes the web more open because it puts nails in the coffin of Flash, Unity Player, and the like.

Back in the day we had Flash-only web sites. They died off because it turns out that it's easier to just throw HTML, CSS, and JS at people, and since these were open standards, there was a fair bit of competition between different vendors to make better tools for consumption, authoring, and delivery.

If you want to reimplement that experience in WebAssembly, the ROI is going to be horrible and if you render everything to a canvas your site won't be indexable by Google (goodbye traffic) or workable with screen readers (hello lawsuits).

Someone · on Dec 12, 2017

I see that change as soon as DOM bindings for WebAssembly are available.

Apple/Google/Microsoft would be stupid if they weren’t already working on a Swift/Kotlin/C# to WebAssembly compiler, as each of them would give them for their preferred language what JavaScript now has: the ability for developers to work in a single language, and move code between browser and backend at will.

Long-term, JavaScript is dead.

icebraining · on Dec 12, 2017

That already happens, since you can already write compilers that target JS: Google wrote GWT and Dart. Microsoft seems to have gone the other way, pulling JS into native apps.

Someone · on Dec 12, 2017

That is true, but it still is on JavaScript’s turf. With WebAssembly the JavaScript will be as ‘native’ as any other language. I think the psychological difference matters.

alexasmyths · on Dec 12, 2017

"WebAssembly makes the web more open because it puts nails in the coffin of Flash, Unity Player, and the like."

In practice, it does not.

It's too low level, and the APIs to access browser features are not available, so you have to go through seriously ugly APIs and just do it in JS anyhow.

So if you have to go to JS to do pretty much everything, why are we using WASM again?

Performance? Is performance the issue? Because if it is, isn't there maybe better ways of solving that problem?

It's certainly not for portability, we already have that.

To make 'web development available to C coders' ... Not sure that's it since they will invariably have to know web browsers anyhow ...

WASM is a great idea that won't gather steam until there are direct APIS. Which is on the roadmap I think.

I'd suggest just trying to build a simple WA, or even trying to make a simple app out of WA. Things look different then.

It's a nice dream to have, we're still a ways off.

klodolph · on Dec 12, 2017

This is a straw man argument. WASM is obviously not a compelling alternative to JS. There are few people crazy enough to write web apps in C. Is that the "nice dream to have"?

However, I can think of two very compelling use cases for WASM.

As an addition to JS, it gives JS programmers access to a wealth of libraries and programs written in other languages. Codecs and compression algorithms, linear algebra libraries, image and audio processing, and a million other things have good libraries that just don't happen to be written in JavaScript. Adding a WASM blob as a library you can call from JS is a huge success. Suddenly you can do tons of client-side processing that was too prohibitive before (from an engineering cost standpoint).

As an alternative to Flash and Unity Player, you can write games for the web using WASM without having to deal with JavaScript.

Having used WASM personally, I can say with conviction that it is a compelling addition to the open web.

alexasmyths · on Dec 12, 2017

"lash and Unity Player, you can write games for the web using WASM without having to deal with JavaScript."

No, you have to go through a seriously crazy api wherein data is passed into a single managed array in wasm, and pulled off in JS and visa versa.

Ok - so accessing C libs is useful?

Are bindings from other languages to C considered to be really an important things for those languages?

Was there a specific set of libs that we were angling for?

Does it even make sense to do those things in the browser?

And what about the limitations - i.e. having to pass data into and out of the WASM container - does that play well with usage of 'codecs'?

Maybe not.

So why again does WASM exist?

What as the compelling use case?

Flash/UX is definitely not it.

I don't see any real uses cases for things like codes, nor was I seeing demand.

So where is the market/product need? And use case?

This is cool tech that's a long way off from being useful in the real world.

Having gone through the pain of writing my own little app, I realized that it's still fairly bleeding edge. I will go back to it when there are deeper APIs between JS and C, are the toolchain is more stable.

klodolph · on Dec 12, 2017

> No, you have to go through a seriously crazy api wherein data is passed into a single managed array in wasm, and pulled off in JS and visa versa.

That's false. When writing a game, you interact with APIs in your implementation language (e.g. SDL).

> Are bindings from other languages to C considered to be really an important things for those languages?

Yes, it's a fairly critical feature, which is why successful languages almost universally have a C FFI. Some languages have several FFIs to choose from, and others can even integrate with the C preprocessor. So many libraries that we use on a daily basis are written in other languages than the languages we use, and that's just how modern programming works.

> So where is the market/product need? And use case?

> This is cool tech that's a long way off from being useful in the real world.

It will happen faster than you think. Some JS developer somewhere will want to use a native library. They will want to use it badly enough to port it to WASM, write a JS wrapper interface, and upload the whole thing to NPM. Suddenly, a C++ developer with no knowledge of JS will be delivering features to front-end developers with no knowledge of C++, all thanks to some third developer who put in the effort for the WASM port.

Which is how it already works everywhere else, so it shouldn't be a surprise. For example, you can use SDL from C#, even if you don't know C and the SDL developers don't know C#. All because some guy with a bit of knowledge about P/Invoke figured out how to copy data in and out of the C# runtime to make some SDL bindings, just the same way you'd have to do it with WASM.

I mean, you can complain about problems with copying data in and out of WASM, but it already works that way with other FFIs.

alexasmyths · on Dec 12, 2017

"That's false. When writing a game, you interact with APIs in your implementation language (e.g. SDL)."

Ultimately - information must not only be passed through an ugly memory-reference API, but anything that happens on the screen, must happen in JS.

Ok, sure, you can abstract it all away, but that doesn't change the fact of the matter.

It would be more powerful and efficient to be able to do stuff natively inside the container.

" Some JS developer somewhere will want to use a native library. They will want to use it badly enough to port it to WASM, write a JS wrapper interface, and upload the whole thing to NPM. Suddenly, a C++ developer with no knowledge of JS will be delivering features to front-end developers with no knowledge of C++, all thanks to some third developer who put in the effort for the WASM port."

Agreed - but I still don't think we are quite there yet.

fourthark · on Dec 12, 2017

> Ok - so accessing C libs is useful?

Yes.

> Are bindings from other languages to C considered to be really an important things for those languages?

Definitely!

> Was there a specific set of libs that we were angling for?

No, there are too many to count. For example, I am looking forward to using network visualization libraries that no one here has heard of. They would be very difficult to port from C++ to JS.

> Does it even make sense to do those things in the browser?

Yes.

wruza · on Dec 12, 2017

Are lawsuits like that a real thing in EU? We’re considering to make ui, at least hard part of it, in a canvas for new web-based projects. Of course, there is no intention to discriminate people with sight issues. I think we could provide different/better interface to cover this area.

Besides that, are modern sites actually accessible in that regard?

klodolph · on Dec 12, 2017

No idea about the EU, but lawsuits under the ADA (Americans With Disabilities Act) are real in the US.

And yes, modern sites really are accessible. Screen readers traverse the DOM looking for the semantic information that's already there. Look at WCAG (https://www.w3.org/TR/WCAG/), you'll see how easy it is to follow the guidelines. Unless, of course, you stick a bunch of stuff in a <canvas>...

spectredev · on Dec 13, 2017

The ADA hasn't held up to digital, see Access Now v. Southwest Airlines

wruza · on Dec 14, 2017

Thank you, this example really makes some important details clear.

api · on Dec 12, 2017

This is already possible using emscripten without too much effort. Emscripten JS is basically binary code. It's also been possible with Flash for over a decade, though fewer and fewer people have that installed.

brucephillips · on Dec 12, 2017

Tellingly, this is effectively the "dumb pipe" version of the web that Alan Kay so voraciously advocated for.

tqkxzugoaupvwqr · on Dec 12, 2017

Gives me nightmares. But today it’s already possible and I haven’t seen this yet. Ad supported industries haven’t moved to canvas-only websites to circumvent ad blockers. I hope this is a predictor of the future.

jopsen · on Dec 12, 2017

I find it unlikely... You would effectively have to implement a browser in webassembly, as people will expect to be able to select text etc.

The degration in detach indexing and user experience, load time and battery usage would quickly make a first mover loose users.

hughes · on Dec 11, 2017

I'm admittedly inexperienced with WebAssembly, but I'm a little unclear on going beyond the basic "hello world" implementations.

Is the idea to write an entire application in WebAssembly, or is it to break out parts of the application that benefit from performance improvements and potentially cross-platform code bases?

Also, how do you do things like make network requests or decode JSON in webassembly? Do you need to implement these things yourself, or pass things back to the browser to handle them, or find C libraries that do that for you?

kodablah · on Dec 11, 2017

Case by case situation. WASM is nothing but a spec and impl for CPU tasks. If you want to do something like networking, you call out to JS (today). If you want to decode JSON, you could call out to JS, or populate some part of the WASM-accessible mem (a byte array) and then ask WASM to operate on it. It could be a C library compiled to WASM or hand-written or in some other compile-to-WASM lang.

skocznymroczny · on Dec 11, 2017

WebAssembly might be a Javascript killer for high-performance uses. Javascript for uses such as game development is dragged down by the garbage collector, forcing you to orient entire code around minimizing allocations. Without garbage collection, you have to manage memory yourself, but allocations aren't as scary as they are with GC languages and they are more explicit.

vardump · on Dec 11, 2017

Garbage collection contributes pretty little to Javascript's performance issues. Unless you deal with some allocation happy data structures, such as linked lists.

vanderZwan · on Dec 11, 2017

What you say is true for regular websites, but game code has different needs than the type of JS you normally use in websites.

I've seen people do some very naive stuff, like:

    var particle = {
      color: randomColor(),
      position: new Point(x, y),
      // number of frames the particle lives
      decayIn: 50 + Math.ceil(Math.random()*100),
    };

... for very short-lived particle effects.

This does not scale if you want to draw thousands of particles that each last a few seconds.

If they were smarter about it, they could create a pre-allocated "pool" using a maximum nr of particles and re-use them.

Likely to be even faster would be preallocating one Uint32Array (for color), two Float64Arrays (one for x and y), and one Uint8Array (for decayIn), and then use an index to look up an individual particle's color, x, y, and remaining frames.

Anyway, what I'm saying is: idiomatic JS definitely can result in garbage collection-related bottlenecks for game development, but a lot of this can be avoided by using non-idiomatic coding styles (like heavy dependence on TypedArrays).

vanderZwan · on Dec 11, 2017

> but a lot of this can be avoided by using non-idiomatic coding styles (like heavy dependence on TypedArrays).

Too late to edit, but this really should have finished with:

... at the cost of code that is more verbose yet the opposite of self-documenting, inherently frail even if you put wrapper-functions around those typed arrays, and likely to require massive changes all over the code-base for the smallest updates.

In my opinion, unless you have a really good reason to not use alternative options like WASM or Emscripten, trying to squeeze that kind of performance out of vanilla JavaScript it's unlikely to be worth the technical debt.

austincheney · on Dec 11, 2017

GC is the primary performance killer for persistent CPU intense applications, which is why there aren't many such applications written in JavaScript. The thread is locked when GC happens.

In most other JavaScript applications GC is hard to notice, because it typically runs once the application has finished executing.

vardump · on Dec 11, 2017

CPU intensive applications don't usually allocate that much.

Excessive allocation makes also other languages, such as C++, very slow.

imtringued · on Dec 12, 2017

GC is not a problem for CPU intense applications.

The fundamental problem with a GC is that all threads must be stopped to perform the garbage collection.

GC pauses are a problem for realtime applications because an unpredictable GC pauses can blow your time budget of 16ms (or even less) per "tick". The only way to avoid this is by not allocating anything during the realtime segments. Since the GC "infects" your entire program you are also not allowed to allocate anything during the non realtime segments. So you might as well just write C++ code.

If the heap is large enough (several dozen gigabytes) the GC pauses may cause latency spikes that last more than a second.

fulafel · on Dec 14, 2017

See eg https://v8project.blogspot.com/2017/11/orinoco-parallel-scav... - some JS GCs are parallel and it's not the case that the "thread is locked" for the whole duration of the GC.

TekMol · on Dec 11, 2017

Is WebAssembly related to asm.js?

I don't like to compile stuff. But improving performance by writing certain functions directly in assembler or in a subset of js would be awesome.

acemarke · on Dec 11, 2017

In a way, yes.

asm.js was a deliberately limited subset of JS syntax. By adding a "use asm;" statement, you signaled to the JS engine that everything after that was limited to the asm.js subset, allowing it to simplify the parsing logic and use a fast path for loading the code. You would rarely write asm.js syntax by hand, but rather use it as a compiler output format.

WebAssembly takes that further by defining a bytecode that the JS engine doesn't even have to parse, but can accept and work with more directly. This speeds up loading time. Also, loosely put, the way it's defined allows the JS engine to execute it faster.

WebAssembly is definitely not intended to be written by hand. Languages like C++ and Rust can now compile to WebAssembly, allowing non-JS source code to be run in a browser. (There _is_ a textual representation of the bytecode as well.)

brian-armstrong · on Dec 11, 2017

Compiling stuff is awesome. I never want to discover my bugs at runtime.

goatlover · on Dec 11, 2017

Even when prototyping, exploring or doing one-offs?

I'd rather have the flexibility in those situations.

brian-armstrong · on Dec 11, 2017

Absolutely, yes. It's preferrable to trying to debug inexplicable glitches in the prototype that the compiler could have caught. And with incremental builds, compilation should be pretty lightweight anyway.

wvenable · on Dec 11, 2017

I'm not sure why it makes a difference. If it doesn't work when statically compiled it won't work when dynamically either. Given things like intellisense prototyping and exploring should be faster.

giancarlostoro · on Dec 11, 2017

Yes:

https://en.wikipedia.org/wiki/WebAssembly#Development

thephyber · on Dec 11, 2017

Can someone explain to me why we would expect WebAssembly to be much different than Java Applets?

There are inevitably defects in the design and/or implementation which will open up browsers to gaping security vulnerabilities until they are found and fixed.

AgentME · on Dec 11, 2017

WebAssembly only uses the same web APIs as javascript, has generally been implemented using code already used for supporting javascript, and was designed from the ground up for this use inside the web sandbox, as opposed to Java which was designed in the same way as most language runtimes based around unsandboxed access to a system and then had a security layer grafted onto it.

alexasmyths · on Dec 12, 2017

#1 issue with applets was integration. Was it on the browser? Which version? How does auto-update work? etc.

Adobe did an amazing job of making sure Flash was there, and seamless. To the point people barely had an awareness.

Java Applets failed because Sun was a 'debating society' (per James Gosling). Definitely not a consumer oriented company, thinking about end user behaviour etc..

Other reasons as well, but that's #1.

Source: me, deployed a bunch of Enterprise projects on Applets back in the day.

nipplesurvey · on Dec 11, 2017

operating from the premise that most applet exploits functioned by elevating a sandboxed applet out of its sandbox (no idea if thats accurate), perhaps because java applets had an un-sandboxed mode built in, whereas wasm is always sandboxed and has no equivalent to applet's un-sandboxed mode.

mike_hearn · on Dec 12, 2017

It's a bit tricky. I don't know if you can categorically say wasm is or will be more secure than applets. Both of them rely on a runtime that is more privileged than the code being executed, of course. Both of them can have runtime bugs that allow privilege escalation of various forms. There was a recent perma-root bug in ChromeOS and one of the exploits along the chain was based on exploiting a WebAssembly runtime bug.

If you look at Java's security track record in recent years, there's been just a couple of zero days in the past five years. That's probably a mix of genuinely better security and less attention due to being kicked out of web browsers. But if you go look at the security histories of other sandboxes like browsers or kernels, it starts to look pretty good. A new paper that just came out introduced a new Linux kernel fuzzer and discovered, I think, over 30 zero day exploits across several different Android phones. So ordinary UNIX style process isolation is pretty useless if you can reach device drivers from inside the processes. Even the quite aggressive and resource intensive browser sandboxes that browsers use routinely have escapes, often because they're always adding large new attack surfaces like WebAssembly, WebGL etc (all written in C++).

nipplesurvey · on Dec 12, 2017

very interesting, did you read about the linux kernel fuzzer and the ChromeOS root here or elsewhere? regardless if you can provide a link either sound like very interesting topics.

LyndsySimon · on Dec 11, 2017

This is the first time I've seen disassembled WASM. It's actually quite readable!

danschumann · on Dec 11, 2017

Is there a way to compile javascript in web-assembly, to protect the source code?

IE, you'd have to convert the javascript to C, and then compile the C into web-assembly, right? Does V8 have a method to convert JS to C?

s-macke · on Dec 11, 2017

This repository compiles asm.js into webassembly [1].

Otherwise, you will have a hard time to convert your Javascript code. You have to implement a JIT compiler and GC too. Webassembly was never designed to be a replacement for JavaScript. And especially, it was never designed to protect your source code. A minimizer/obsfucator does a great job of making your JavaScript code unreadable.

[1] https://github.com/WebAssembly/binaryen

baybal2 · on Dec 11, 2017

>And especially, it was never designed to protect your source code.

I hardly get that, all and every major backer of WASM did effectively admit at some point that "ip protection" was the one and only reason they back it.

The absence of a native dom interface or proper I/O implementation at this point means that WASM compiled code actually runs slower for everyday web apps than non-compiled code.

the_duke · on Dec 11, 2017

> all and every major backer of WASM did effectively admit at some point that "ip protection" was the one and only reason they back it.

I'm sure you wouldn't mind providing sources for this bold claim?

s-macke · on Dec 11, 2017

I have looked at obsfucated JavaScript code as well as webassembly disassembly. I don't see much difference. What might obfuscate is the fact, that it comes without any libraries. You have to program like in C without any library support, which is more or less what you do with bare metal programming now.

kuschku · on Dec 11, 2017

> Is there a way to compile javascript in web-assembly, to protect the source code?

This kind of obfuscation is harmful to the web, don’t do it. Also, any compsci student with a summer of free time can easily break it, so it’s useless, too. You’re just wasting your own time and the bandwidth and performance of your users.

In fact, if you tell me what site you want to obfuscate your code on with WASM, I (an average compsci student) will spend my next summer reversing it, deobfuscating it, and publishing the results publicly (just to demonstrate you how useless your obfuscation is).

danschumann · on Dec 11, 2017

Is photoshop harmful to the web? Should they give away their source code? Their source code is what's valuable in their company. So, art programs kind of need source protection. I won't be making a web-version until this is the case. If you can crack NW.js source protection, let me know http://docs.nwjs.io/en/latest/For%20Users/Advanced/Protect%2...

kuschku · on Dec 11, 2017

> Is photoshop harmful to the web?

Photoshop isn't on the web, but I oppose closed source in principle.

> Should they give away their source code?

Ideally, yes. If you do it is still your decision, but you still should be aware that if you introduce truly new concepts, someone will clean-room reverse them, and publish them.

> So, art programs kind of need source protection.

The Krita project begs to differ.

> I won't be making a web-version until this is the case. If you can crack NW.js source protection, let me know

Then you shouldn't make any build where the business code runs on the client. Even native binaries can be reversed, their functionality analyzed, and copied. Which is how LibreOffice/OpenOffice originally discovered how to parse the Microsoft Office formats.

danschumann · on Dec 11, 2017

It's possible, but not very ideal to have all the operations done on the server, and just serve interface stuff, maybe even as a streaming video, to the client.

When people paint, they want it responsive, no stream lag, no server hiccups. I must allow people to install the software on their computer. I would still like to protect the work I've done, because I'm just one guy making software, and I don't have marketing money. What would stop big companies from taking my source and spending more on advertising? I need source protection.

kuschku · on Dec 11, 2017

> What would stop big companies from taking my source and spending more on advertising? I need source protection.

If your threat model is a company with billions in of dollars in revenue, then source protection is useless.

NW.js protection can be cracked, too, requiring maybe one or two man-months more. For a student reversing it, this is annoying, but not a problem.

For GoogleAppleAdobeZon? They dont even notice the cost difference. You can be Snapchat, have unique features, and have it all obfuscated — and Facebook can still copy it all in weeks.

The question isn't "will Adobe be able to copy my app" — the answer to that is always yes.

danschumann · on Dec 11, 2017

Sure it's possible, but I'm trying to make it as hard as possible. Currently I think NW.js source protection is the hardest.

I think there are times when open source isn't the best. I never would have spent so much time making art software if I didn't think I could make it reasonably hard to crack. Eh, maybe I would have anyway, but I might have just made another mindless web app that wasn't difficult to code.

I think there are definitely cases where closed source works, especially when certain algorithms are easy to read but hard to contrive.

kuschku · on Dec 11, 2017

Well, NW.js is the strongest on the web.

For native binaries, such as games, currently the strongest solution is Denuvo, from a team that formerly implemented copy protection for Sony. Decades of engineering went into it, at the risk of significantly slowing down the game, using several layers of obfuscation (including VMProtect), all wrapped around the existing, older obfuscation schemes, for a native binary.

The first version of it took a few hobbyists 6 months to break, by now games protected with it get usually broken within of one week or less by hobbyists.

That’s hobbyists.

If, as you truly say, your threat model is Adobe, or a similar corporation, your obfuscation is basically useless. If determined individuals can break some of the most advanced obfuscation schemes in days, how long do you think will it take Adobe to break yours, if they actually see you as competitor?

> I think there are definitely cases where closed source works, especially when certain algorithms are easy to read but hard to contrive.

They work short-term, but as soon as someone extracts the algorithm once, everyone has it. Especially considering that in most places algorithms are neither protected by copyright nor patents (including the EU).

Fifer82 · on Dec 11, 2017

Denuvo... I can't remember the game (it was shite) but it got cracked 6 hours before release. You have to laugh especially because it actually did create low FPS so all the actual customers were pissed off.

Same story since the Amiga Days

esrauch · on Dec 11, 2017

Do you believe that Adobe would lift algorithms from your non-open source JS if not for NS.js? Or are you worried about some other tiny company?

As someone who worked for a while on painting tech at a BigCo, the truth is that the chance that a BigCo would steal an algorithm from your one-man-paint-app's minimally obfuscated JS is extremely close to zero (excluding if it was MIT/BSD OSS license, and even then I would be pretty surprised if it happened). Anything half as novel as you are describing would be better leveraged by talking about some of it as siggraph and then acqui-hire selling your company for millions to a Adobe/Microsoft/Apple type.

It makes perfect sense to not open source it, but if you are trying to build this as a business any amount of time you second you spend trying harden your source from some hypothetical reverse engineering boogeyman (that is likely to never even be trivially attempted) is time that would be much better spent on marketing or improving your product.

imtringued · on Dec 12, 2017

>What would stop big companies from taking my source and spending more on advertising?

Almost every country on this planet has copyright laws.

If you're worried about them ignoring the law and taking your source code why are you not worried about them ignoring the law and physically stealing your computers?

justinpombrio · on Dec 11, 2017

If you just want to obfuscate your JavaScript, there's nothing to be gained by compiling to a different language. There are already plenty of JS->JS obfuscation libraries that work well.

_s47s · on Dec 11, 2017

NW.js offers Javascript code protection [0] by allowing to distribute code in bytecode form. I just tested it yesterday and my code runs as fast as the original.

[0] http://docs.nwjs.io/en/latest/For%20Users/Advanced/Protect%2...

skybrian · on Dec 11, 2017

That's unlikely to be efficient and it wouldn't help much anyway.

Running JavaScript through an optimizing compiler and minifier will obscure pretty much the same information. But either way, you can step through the code in your browser's debugger to see what it does.

dangerbird2 · on Dec 11, 2017

Also, wasm is a fairly high-level bytecode, more comparable to jvm or llvm code than arm or x86 code. It's pretty easy to follow the logic of disassembled web assembly

slededit · on Dec 11, 2017

WASM doesn't have access to the DOM yet so it's unlikely your JS could would work anyways.

icebraining · on Dec 11, 2017

Apparently http://enclosejs.com can compile JS to regular binaries, but I don't know if there's any way of converting an ELF binary to wasm. You could always run a full emulator + kernel (using JSLinux) to run that binary :)

_greim_ · on Dec 11, 2017

No. wasm needs to be able to interact with the DOM and other web APIs, which I understand isn't the case currently. Then you need hooks into GC. Although I guess nothing would stop you from embedding a JS runtime in your wasm blob, other than download size.

jwilk · on Dec 11, 2017

"#include " was meant to be "#include <webassembly.h>", but somebody forgot to escape <> in HTML source.

arnoooooo · on Dec 12, 2017

I thought I fixed that one :/ It's fixed for good this time. Thanks !

Thaxll · on Dec 11, 2017

ActiveX is back. Can't wait to see all the exploits escaping the browser sandbox.

cdancette · on Dec 11, 2017

Isn't it the same with JavaScript?

chungy · on Dec 11, 2017

Yes, and WebAssembly runs on top of the existing JavaScript VM engines.

Thaxll · on Dec 11, 2017

I don't think it runs on the same V8 ( chrome ) that the one runs JS, I remember 2 engines running.

DiThi · on Dec 11, 2017

The wasm binary itself may be using a separate engine, but heavily sandboxed: it can only communicate with JS, and nothing else. The (newly added) attack surface is minimal when compared with Flash, Java, etc (each with its own APIs). Almost any flaw must be in the JS side, and that's already battle-proven for years.

ActiveX didn't even have a sandbox, so not even related.

chungy · on Dec 12, 2017

The attack surface with classic NPAPI plugins (Flash, Java, etc) is practically unlimited. It was up to the likes of Flash and Java to secure themselves, not the browser. Various random plugins from other sources (eg, the QuakeLive and Unity plugins) may as well have cared about security _even less_ than the already abysmal track record of Flash and Java.

anomie31 · on Dec 11, 2017

Did ActiveX even have a sandbox?

jamesgeck0 · on Dec 11, 2017

Nope! But you had to promise Verisign that you wouldn't build any malware.

cm2187 · on Dec 11, 2017

I think it's more like a java addin. Which if properly sandboxed is a good thing.