Chris Lattner to Lead SiFive Platform Engineering Team

leggomylibro · on Jan 27, 2020

I can't wait to see more affordable RISC-V microcontrollers on the market!

The Kendryte K210 looked very cool, especially with its SIMD-ish "machine learning coprocessor", but it felt like they had rushed the hardware to market without investing in scrutable documentation or software support, last time I checked.

The GD32V series looks fantastic, since the current crop of GD32VF103 chips appear to be API-compatible with the venerable STM32F103 workhorse. But I haven't been able to find a source of the raw chips yet, it seems like you can only get them on development boards at the moment.

And there are always softcores running on FPGAs, but those sort of highlight how many permutations of "RISC-V" exist. I hope that we don't end up with too many inscrutable compiler flags to juggle as more of these chips become available.

cculpepper · on Jan 27, 2020

Not completely API-compatible. They are /extremely/ similar. Giga Devices did make an ARM STM32 clone, so they probably just did a ctrl-c, ctrl-p on the peripherals. Addresses are slightly different, and things aren't quite the same. I just ran into an issue with the system timer and it's lack of documentation... But good news is that they are pin-for-pin compatible, and you can put a raw chip onto a blue-pill board and use it!

You can get raw chips from taobao. I used taobao and a reseller, superbuy to get mine. Not bad at all!

leggomylibro · on Jan 28, 2020

Huh, what do you search for on Taobao to find them? I tried "GD32VF103CB" a week or two ago, but I only got results for the GD32F103 ARM clones.

I guess it makes sense that the addresses aren't quite the same; iirc ST has some licensing restrictions on their SVD/header files saying that you can't use them with other vendors' chips anyways.

Thanks for the extra information!

cculpepper · on Jan 28, 2020

Whoops, I meant TMall[0], its like aliexpress vs alibaba, or vice versa. Same buying process through a third party. Apologies for that.

[0]https://list.tmall.com/search_product.htm?q=gd32vf

magicalhippo · on Jan 28, 2020

I'm guessing you're after the bare MCU, but for others, you can get a nice relatively cheap prototyping board with the chip from Sipeed, for example:

https://www.seeedstudio.com/Sipeed-Longan-Nano-RISC-V-GD32VF...

SeeedStudio has a few others as well:

https://www.seeedstudio.com/tag/RISCV-Board.html

Just note that you need a relatively new J-Link (v10 iirc) to program RISC-V cores using J-Link, alternatively for the Sipeed Longan boards, pick up two of them and you can flash one with a provided debugging/uploading firmware.

leggomylibro · on Jan 28, 2020

Yes, the Longan Nano is very nice but I'm also interested in the bare chips. Seeed also sells cheap JTAG dongles that work with their RISC-V boards, which is nice.

cculpepper · on Jan 28, 2020

I've been using a clone J-Link which works pretty good. I got it probably 5 years ago, so it's impressive that it works. I think I'm using a patched version of openocd with it though.

ohazi · on Jan 28, 2020

What's the appeal in using an STM32 clone? Are you building things in quantities where saving a few pennies is really worth the headache of all the subtle incompatibilities?

leggomylibro · on Jan 28, 2020

The appeal is that there is a lot of good example code showing how to use the communication peripherals, but the CPU core is RISC-V instead of ARM Cortex-M3.

Also they are rated for higher clock speeds and claim better power efficiency.

m0zg · on Jan 28, 2020

In lieu of the documentation, Kendryte has _a ton_ of code on their github, so you could use that if you're looking to work on something practical. For me, having a dual core risc-v, with _TPU_ and _DSP_ onboard for $8 in single digit quantities is absolutely nuts. I frankly didn't believe it and thought the spec sheet was fake, but I've since bought some real dev boards with this chip, and they work. Even if you don't need the TPU bits, just having dual-core risc-v is more than worth the price of admission.

labawi · on Jan 28, 2020

I don't care about TPU, don't need DSP, but dual-core - yes, please do. Having a dedicated core for hard-real time and using the other for more complicated things is way better than single core. I've seen so many projects attempting to cover timing glitches from their single cores doing everything.

I don't understand why there aren't many multi-core offerings. It's not as if the silicon would cost a lot more, especially when it would be possible to downgrade the speed and/or size.

m0zg · on Jan 29, 2020

It certainly would be pretty awesome if there was a modification of this chip without a TPU but with an MMU instead, so it'd be capable of running a proper OS rather than FreeRTOS or bare metal.

azhenley · on Jan 27, 2020

Seems like he has been changing companies quite a bit recently. Is this typical for the VP level? Does anyone know how his tenure at Google was viewed by others?

omarhaneef · on Jan 27, 2020

Perhaps they were looking for at least 10 years of Swift experience and were disappointed to learn he only had 7.

drhackenstein · on Jan 27, 2020

It’s funny because it’s true

hn_throwaway_99 · on Jan 27, 2020

Well, his time at Tesla was basically (publicly, IIRC) determined to be a bad fit between him and the company, so can't really fault him there. So since his long tenure at Apple he had a misstep at Tesla and a shortish, but certainly reasonable, length at Google. Would definitely not call that job hopping.

pier25 · on Jan 27, 2020

> determined to be a bad fit between him and the company

How so?

Edit: Jesus it was a legitimate question, why the downvotes?

jaimex2 · on Jan 27, 2020

He's a talented computer language architect but really had no place leading a machine learning heavy team, huge waste of everyone's time and money.

This was a Tesla hiring mistake. You get an expert in a field, you can't just shove a guy whos good at one thing and expect him to excel in another.

Andrej Karpathy basically threw everything out once he replaced Latner.

Joky · on Jan 29, 2020

You seem to have the impression that a machine learning model is all you need to drive the car: this isn't true. There is a lot of real-time system programming and overall architecture.

> Andrej Karpathy basically threw everything out once he replaced Latner.

What are you basing this on? I don't believe there is any truth to this. First Chris was not replaced by Andrej but by Jim Keller and then Stuart Bowers. Andrej is amazing in his field and has done impressive things at Tesla, but he is managing the ML part of the org, which was a relatively small portion of the Autopilot Software group overall.

runjake · on Jan 27, 2020

Lattner initially elaborated on his departure from Tesla on his website but later removed many of the details.

IIRC, he and Elon had some disagreements on software things.

hinkley · on Jan 28, 2020

"Never write software for a hardware company."

focus2020 · on Jan 28, 2020

Is there link to archive of the original post ?

runjake · on Jan 28, 2020

It looks like Chris understandably removed that version of Resume.html from web.archive.org.

Do a DuckDuckGo search for "lattner elon musk" and you'll find the info you are looking for. There was significant media coverage at the time.

coldnose · on Jan 27, 2020

https://twitter.com/clattner_llvm/status/877341760812232704

jdelman · on Jan 27, 2020

That doesn’t really answer the question. “Not a good fit” is a bit of a euphemism in the industry - it typically is a polite way of saying “I didn’t get along with the CEO/CTO/leadership because X, Y, and Z” - and that’s what GP (and myself) are looking for. I doubt we’ll get an answer anytime soon - he’s a classy guy and probably still under NDA with Tesla.

chrislattner · on Jan 28, 2020

With the benefit of hindsight, I'm really happy both that I joined Tesla and also that I left it. I have no regrets at all.

I learned a lot while I was there, but Elon and I had different opinions about some things. It was clear that he wasn't going to bend or change, and those points were important to me.

I have some principles that are extremely important to me, and if they aren't aligning, then it is best to acknowledge that and do something about it, than deny it and be frustrated or unhappy.

E is a force of nature -- for better and worse :-). I'm glad we have someone like him in the world, but that doesn't mean I want to be directly involved.

pgodzin · on Jan 28, 2020

Any lessons on sticking to principles and how to make sure they align going into a project/role? I imagine it's hard to so preemptively rather than when push comes to shove.

0x8BADF00D · on Jan 27, 2020

It doesn’t really matter - he’s no longer there and I’m sure much happier because of it. There are a lot of irrational things management can come up with. Sometimes the manager in particular is a sociopath or narcissist. Or maybe you unknowingly offended them in some way. At that point there’s no reason to stay, regardless of the reason.

bpyne · on Jan 27, 2020

His credentials put him in high demand. Due to this, he has the luxury of changing jobs in order to find his fit. Jumping can dampen the demand at some point, but it hasn't for now.

I wouldn't read anything else into it.

cowsandmilk · on Jan 27, 2020

2.5 years is short now? Pretty sure that’s longer than Google’s median tenure.

nmfisher · on Jan 28, 2020

I don't think it's the absolute length of time, but rather the fact that the projects he was spearheading (Swift for Tensorflow/MLIR) are nowhere near production-ready (or at least they weren't the last time I checked).

2.5 years seems a very short amount of time to develop, launch and push adoption for low-level PL tooling (in an ecosystem that's not mature, but at least is reasonably developed). I would expect 5 years at a minimum.

I'm assuming they'll just die on the vine now?

erichocean · on Jan 28, 2020

MLIR has moved to the LLVM Foundation. Really solid project with a huge amount of momentum behind it.

I would assume the Swift/Tensorflow stuff will die, it was more of a thought experiment anyway…

nmfisher · on Jan 28, 2020

Ah, I wasn't aware that MLIR was now under LLVM. I'm not heavily involved with the LLVM Foundation but the whole project seems to be stable and well run, so that makes me feel more comfortable about the future of MLIR.

focus2020 · on Jan 28, 2020

Not true. My experience is that no body respects a developer staying in a company(excluding faang) over 5 years max. It is looked as incompetent.

nmfisher · on Jan 28, 2020

Well, I mean Lattner was at the G in FAANG :)

_pd19 · on Jan 27, 2020

For that level, I would consider it short - no?

pjmlp · on Jan 27, 2020

For startups no, for classical jobs, at least here in Europe, yeah.

ChuckNorris89 · on Jan 27, 2020

Depends what skills you have and how hot your market is. I know some people in big metro areas like London or Bucharest switching every year without problem as their skills are in high demand.

pjmlp · on Jan 27, 2020

In German speaking countries that kind of job hoping isn't well seen, at all.

Anyone switching after an year means they barely made it after the six months probation time, so many employers will think twice about hiring them, unless they are in pressure to hire someone.

ChuckNorris89 · on Jan 28, 2020

German speaking countries(apart from Switzerland) also don't pay their developers particularly well either as unlike US/UK/Eastern Europe they artificially cap salaries instead of letting the market dictate so of course they don't like people switching jobs as they have to pay more to stay competitive, even managing to convince their developers that €80k is a top salary while their counterparts in Warsaw and Bucharest make that or even more. German/Austrian devs are severely underpaid.

Also, unlike UK/US, they have this nasty habbit of not mentioning their salary range/hiring budget in job ads to low ball you during negotiations.

kayoone · on Jan 28, 2020

Well 80K can get you pretty far in Berlin, as opposed to SF. Rent is probably cheaper than Warsaw too.

blub · on Jan 28, 2020

Which companies in Bucharest or Warsaw pay 80.000€? Most companies in the entire EU don't pay that.

ChuckNorris89 · on Jan 28, 2020

Swiss/London/NY based banks and finance companies.

80K is pocket change compared to what they would have to pay in their home markets for experienced devs.

kayoone · on Jan 28, 2020

Not many outside of finance though..

barrkel · on Jan 28, 2020

I look at it the other way around. They're probably talented enough to make it past probation, so their leaving was them deciding to leave, rather than not making it. That is, any investment you make in them, giving them slack to get up to speed in a codebase, you'll lose because they don't stick around.

azhenley · on Jan 27, 2020

He was also at Tesla for something like 6 months, I think.

microtherion · on Jan 28, 2020

I think the reason people wonder is that Lattner was at Apple for more than 10 years, so 2.5 years DOES look short.

guidoism · on Jan 27, 2020

I interpreted him going to Google as a place to learn a few new things while figuring out what to do next. He’s done some big stuff in the past. But even people like him need time to figure out what to tackle next.

therealnews · on Jan 29, 2020

Anyone who has been following his projects knows that 2.5 years is too little a time for the problems he set out to solve and the projects he was leading (esp. with TensorFlow, MLIR, TF runtime, TPUs). So this is really a very early, abrupt and surprising exit, and several people both inside/outside know that he ran into political issues/struggles with high-level folks. Fortunately, one of the projects (MLIR) is already part of LLVM and will remain unaffected, but looks like S4TF may just die out irrespective of what's being said. Otherwise, we may not hear much on the details given his "seemingly" reasonable length stint at Google.

chongli · on Jan 27, 2020

He seems like the kind of developer that needs a real challenge to work on. When things shift into maintenance mode, he gets bored. That’s my take anyway. He’s a brilliant programmer. I’m glad to hear that he’s working on something really interesting.

mehhh · on Jan 27, 2020

I would avoid binning people into builders or maintainers, there are many more issues that could prompt one to leave Tesla and Google, eg: Work environment or ethical issues.

forgotmyhnacc · on Jan 27, 2020

I wonder what's going to happen to tensorflow Swift now, seemed like Chris was the main champion there.

pjmlp · on Jan 27, 2020

It will most likely fade away.

Swift for Tensorflow could never be taken seriously outside Apple community.

On Linux, Foundation barely works and one still needs to selectively do either import Darwin or import Glibc for basic IO stuff.

Then we are already at Swift 5.1, and Windows version has to be built from source will lots of caveats.

How can it even be taken seriously against Julia, Tensorflow for C++, ML.NET all of which work across macOS, Linux and Windows as of today, and offer the same strong typing benefits?

flipgimble · on Jan 27, 2020

While I agree there is an even chance that Google will allow S4TF to fade away after Lattner's departure, that is more of a reflection on the company having no commitment or consistency to good ideas: ex: https://gcemetery.co

However S4TF should be taken seriously if you understand what they are trying to accomplish and how deeply they designed machine learning support into the language. Take a look at http://fast.ai new course offerings using S4TF. Swift has always been a long bet. If it doesn't work as you want it, is still short sighted to discount it in the future.

also: https://twitter.com/JokerEph/status/1221831507351748608

pjmlp · on Jan 27, 2020

I will count on it, when it isn't used to sell Google VMs and works on my machine as easy as Julia and ML.NET work today.

I wasn't even aware of http://fast.ai's existence.

nl · on Jan 28, 2020

With all due reference to Jeremy and Fast.AI (which I love and continue to recommend) I think it's fair to say they don't stick with things for a long time. I think the Fast.AI python library has been re-written 3 times in the past 3 years (Keras -> PyTorch -> PyTorch, differently).

skohan · on Jan 27, 2020

The Swift core team just added two new members, whose contributions are related to Swift on Windows, and Swift on the server respectively, and have specifically called out supporting Swift in non-Apple domains as a part of the roadmap for Swift 6 (including adding the autodiff work from S4TF into mainline Swift). There is attention being paid to these issues.

pjmlp · on Jan 27, 2020

Thanks for the heads up, I have been looking at that thread, the roadmap still looks quite uncertain, with big questions regarding what Apple is willing to support outside their platforms, tooling (e.g. IDE support), dependencies to WWDC announcements and deliveries.

skohan · on Jan 27, 2020

I have not tried it myself, but I've heard reports that Swift LSP support is going quite well, and it's possible to develop on Ubuntu using VSCode.

nestorD · on Jan 27, 2020

I love the idea of Swift for Tensorflow : powerful automatic differentiation and a solid type system in a single language. That's something that is not seen elsewhere and that would make it a perfect fit to write deep learning code targeting production systems.

Now the language needs two things in order to be safe from an hypothetical abandon from Google : - running smoothly on linux (I though it was already there but your post seem to imply that it is not the case) - getting the auto-diff out of the alpha stage where people can build framework on top of it (fastai seem to be ready to jump on that ball which is nice)

pjmlp · on Jan 27, 2020

Here,

https://swift.org/getting-started/#using-the-repl

> On macOS

    1> import Darwin
    2> arc4random_uniform(10)
    $R0: UInt32 = 4

> On Linux

    1> import Glibc
    2> random() % 10
    $R0: Int32 = 4

Any of the languages that Swift is competing against, doesn't need to have OS specific imports for basic stuff.

asveikau · on Jan 28, 2020

Having distinct Darwin or Glibc imports seems a little crazy for these examples. When you code in c, libc is just there. It has minor variations from platform to platform, but you always think of it as libc. (Even if Darwin calls it libsystem etc. Or if Microsoft's is mscvrt and it kind of sucks.) It would be more reasonable to call it libc. (If I run on BSD do I still call it "glibc" with no actual GNU present? And if the arc4random APIs present in libc there look a lot like Darwin's because they actually share code? Maybe they have not considered porting to non-linux?)

Basically they set a portability boundary in the wrong place.

OkGoDoIt · on Jan 27, 2020

But for context, the previous sentence on that page is: “You can also import any available system modules, such as Darwin on macOS and Glibc on Linux”

The random number examples aren’t saying that’s the right way to generate random numbers in a platform-independent way, it is specifically demonstrating how to import system libraries on your local platform.

slavapestov · on Jan 27, 2020

Swift 4.2 (2018) introduced a cross platform random number API.

    (0..<10).randomElement()

pjmlp · on Jan 27, 2020

Yet they didn't bother to update the example on the introduction to Swift.

There are plenty of other examples I can look for, like file handling.

slavapestov · on Jan 27, 2020

File handling can be done using Foundation, which I believe has API parity between Darwin and Linux at least.

I think the point of the example was to demonstrate importing platform-specific modules, not random number generation per se. But you're right, it should probably be updated to do something else.

coldtea · on Jan 27, 2020

>Yet they didn't bother to update the example on the introduction to Swift.

There are widely popular FOSS languages with much much less documentation...

pjmlp · on Jan 28, 2020

Such as?

coldtea · on Jan 28, 2020

Golang has terser and worse documentation than Swift, for one...

zer0zzz · on Jan 27, 2020

Dude you’re literally illustrating Swifts strength here. The whole point is to have a safe easy to use language that reuses existing system libraries.

pjmlp · on Jan 27, 2020

Any mainstream programming language can do that.

zer0zzz · on Jan 28, 2020

Not many use an actual c/c++ compiler (clang) to generate language native calling convention wrappers for calling into C/ObjC code (not sure how well c++ interop works these days).

coldtea · on Jan 27, 2020

Few as elegant, as safe, or as performant combined...

pjmlp · on Jan 28, 2020

Delphi, D, .NET, Eiffel, Ada, Rust, Nim.

And lets not put Swift and performance on the same sentence, they still need to catch up a bit.

https://github.com/ixy-languages/ixy-languages

coldtea · on Jan 28, 2020

What is this, a cabinet of curiosities?

Aside from .NET and Rust, all the others are not players in 2020.

pjmlp · on Jan 28, 2020

Ah, that might be the reason why NVidia is adopting Ada for their firmware, ebay and Netflix decided to use D for some of their projects, the Turkish government decided to invest into Delphi for their education system,...

By the way, it is a list of languages with compilers that currently outperform Swift.

coldtea · on Jan 28, 2020

>Ah, that might be the reason why NVidia is adopting Ada for their firmware, ebay

A, the "some part of some huge player with 2000 divisions uses some otherwise niche language, surely said language is making a comeback" argument.

You can find all kinds of niche/sidestepped languages if you look hard enough on any organization that has 200 products, 1000 inside projects, and tons of engineers. Doesn't mean said languages are making a comeback anytime soon.

For languages that actually thrive nobody needs to enumerate major and minor projects where they're used, because they're too many to mention. But when the main news for "Planet language X" is "big corp decided to use X for something among the 100s things they do", well, they need all the straws they can grasp.

Same way Latin remains a dead language whether some Oxford professor recently published a book of new poems he wrote in it or not...

mark_l_watson · on Jan 27, 2020

for what it is worth: a few days ago I installed the latest Swift and TensorFlow in Swift on both my little MacBook and my Linux laptop (1070 GPU, i7, fast laptop).

I had to manually set LD_LOAD_PATH on macOS, but then everything worked on macOS and Linux.

MattDamonSpace · on Jan 27, 2020

Maybe, although they did just hire Dave Abrahams, who was on the Swift Standard Library team for years and seems like an excellent hire:

https://en.m.wikipedia.org/wiki/David_Abrahams_(computer_pro...

throwlaplace · on Jan 27, 2020

sadly i agree with you. initially i was really excited about S4TF because swift is a fantastic language and it would be a fantastic replacement for python as the defacto ML language. but then i realized tensorflow is much worse than pytorch and the S4TF team was too small to build anything substantive enough to win market share.

favorited · on Jan 27, 2020

Well they just hired[0] Dave Abrahams[1] last month. Dave is one of the most significant Swift engineers and was on the Swift core team for years.

That would be a weird move if they were looking to outright drop it.

[0]https://twitter.com/DaveAbrahams/status/1207690883782467584 [1]https://en.wikipedia.org/wiki/David_Abrahams_(computer_progr...

throwaway202021 · on Jan 27, 2020

S4TF was started over 2 years ago, and hasn't really gained any ground since then. It's a project on life support and Lattner's departure means the project is going to be put down soon.

It's good to try different things; S4TF tried, and failed. No one really cares about it at Google except the team developing it. The researchers are adopting other new technologies, like JAX.

chrislattner · on Jan 28, 2020

Google is pretty committed to it, don't take my word for it, take Jeff Dean's: https://twitter.com/JeffDean/status/1222033368700706816

ardit33 · on Jan 27, 2020

Unfortunately Swift is a dud.... and the Apple ecosystem is stuck with it for the next few decades....

I have said, Swift is to Objective-C, what Scala is to Java. Sure, there are plenty of people that like Scala, but its 'academic stuffiness and complexity' doomed it to a niche language.

Same with Swift. It is doomed to be an apple ecosystem only type of language.

All I wanted is a Python* look alike, with some solid static typing, what we got was a franken/monster/language where people felt to try out their little academic pet-peeves, and sucking out the fun out of programing with it, and making it less accessible to beginners.

GO is becoming popular, not because it is shoved down the throat to people, but because its own merit, and mainly because they kept it simple. It is a closely to a "static Python and some minimal features" we got....

*I think Python is a great language, and very accessible for beginners, just not suitable for large projects due to its dynamic type system

cwbrandsma · on Jan 27, 2020

I write iOS apps for a living. Most the people I know have moved away from ObjectiveC. When I see language boards, and jobs for iOS, I see Swift asked for over ObjectiveC. On Reddit, Swift has more subscribers than ObjectiveC, most of the tech articules I see these days are ObjectiveC, when I watch a video on WWDC I often see Swift first.

Swift may not be what you wanted, but it is a long way from being a dud. Swift didn't have to be great, it just had to be better than ObjectiveC.

coldtea · on Jan 27, 2020

>Unfortunately Swift is a dud....

Because of some technical argument, or just personal aesthetics?

>I have said, Swift is to Objective-C, what Scala is to Java. Sure, there are plenty of people that like Scala, but its 'academic stuffiness and complexity' doomed it to a niche language.

While Objective-C was really cool, it was neither modern enough, and few people liked it (mostly old NeXT/early OS X guys, but not most of the iOS crowds).

And Swift is easy to use and nothing like Scala in academic-ness and complexity.

>All I wanted is a Python look alike, with some solid static typing*

That's not what the ecosystem needed, or what people in general want (there's Go for that, for one). Swift is somewhere between Rust and Kotlin, features wise.

sumanthvepa · on Jan 28, 2020

I'm not sure this argument makes sense. I'm not primarily a Swift developer. My goto languages are Python, C++, Java and Javascript (I build DevOps automation pipelines and monitoring tools -- primarily for scientific computing). But my experience with Swift on MacOs and iOS has been pleasant. I definitely would not use it for cross-platform development. But that is not because the language isn't nice. It's because it won't have widespread support outside the MacOS ecosystem. But for UI apps on MacOS and iOS, it is the best choice. Similarly, I use C# for Windows UI apps, but would not use it for anything else. Use the right tool for the job.

ddragon · on Jan 27, 2020

Have you checked Nim? It's the closest thing to static Python that I know of (plus some extras like Lisp inspired macros), and there is some early development of a machine learning ecosystem (outside of wrappers):

https://nim-lang.org/

https://github.com/mratsim/Arraymancer

arbhassan · on Jan 27, 2020

Really loved listening to his recent podcast with Lex Fridman[1].

[1] https://m.youtube.com/watch?v=yCd3CzGSte8

hinkley · on Jan 28, 2020

When I read that SiFive is working on custom silicon, my first thought was to wonder what would happen if custom hardware and custom programming languages co-evolved together, instead of languages adapting to old hardware that's adapted to older programming languages.

.. and here he is talking about compilers. I might have to keep an eye on SiFive in addition to Oxide.

There are some other people talking about him not staying long at places. In this talk he mentions how he intended to stay at UIUC for one year and got 'nerd-sniped' into staying for 5 years building LLVM. After an experience like that, I could see how someone might feel claustrophobic and tend to take any opportunity on offer - if it's interesting enough.

chrislattner · on Jan 28, 2020

Yeah, I don't expect people to understand how transformative SiFive is. Give it a couple years and it will make sense :-)

WanderPanda · on Jan 27, 2020

"Recent" seems relative

mark_l_watson · on Jan 27, 2020

I wonder if this will curtail the effort to implement TensorFlow in Swift, turtles all the way down?

That would be a shame. Python ecosystem with TensorFlow, PyTorch, mxnet, etc. has been good for rapid progress but I think we need something better to break out of just using deep learning. This needs a hackable infrastructure. I personally don't have the skill to hack the C++ TensorFlow core.

I think a new ecosystem based on Swift, TensorFlow, and future tools and platforms makes some good sense.

An alternative would be a similar hackable infrastructure based around the Julia language, which is also very good.

byt143 · on Jan 27, 2020

Even in swift it isn't "turtles all the way down".

The AD stuff is hardcored into the C++ guts of the compiler, whereas Julia's source to source autodiff accesses a compiler pass from a fully Julia user package.

Aside from making it easier to hack and improve the AD system as just a Julia user, this capability enables other package program transforms like that in https://github.com/MikeInnes/Poirot.jl for prob programming.

So Julia is already further ahead in that regard and it's more hackable.

mark_l_watson · on Jan 27, 2020

re: "So Julia is already further ahead in that regard and it's more hackable."

I agree. Flux is very concise, very nice to work with. I just had some trouble with my small playing-around code snippets when going from one minor release to the next, but that probably means I should revert to the LTS 1.* version.

I have tried Julia with non-mathematical stuff like using it with sqlite, fetching and using RDF data, and general text processing - nice for those use cases also.

DSingularity · on Jan 27, 2020

After apple Lattner went from to tesla to google to sifive in no time flat.

CalChris · on Jan 27, 2020

He went from Tesla to Google in no time flat, less than 3 months. But he was at Google from August 2017 until now which is significantly more than one ISO no time flat unit.

GrayTextIsTruth · on Jan 27, 2020

That’s what I was thinking but his LinkedIn shows he was at google for over 2 years.

dang · on Jan 27, 2020

The corporate press release, which doesn't say much either, is https://www.businesswire.com/news/home/20200127005141/en/Goo...

(via https://news.ycombinator.com/item?id=22160226)

icandoit · on Jan 27, 2020

The website looks nice, but I can't find any prices. They won't let you register with a web email (eg gmail) account. Not cool.

wmf · on Jan 27, 2020

First you get a quote from Arm then you take it to SiFive and they beat it.

nickik · on Jan 27, 2020

If you can get a price from them in the short term

syntaxing · on Jan 27, 2020

I was super tempted to buy the new learning development board they just released (which is actually tough to get in the US) but I actually haven't been able to figure out the benefits of the new SiFive processors compared to other traditional arms boards like a M0. Anyone here familiar with their boards that can provide some insight?

Erlich_Bachman · on Jan 27, 2020

Do you mean besides that fact that it is a RISC-V? You know it is an open source architecture right?

creato · on Jan 27, 2020

Why should a random tinkerer trying to learn or get a basic job done with a microcontroller/CPU care whether the architecture is open source? It's still overwhelmingly difficult to do anything interesting with the fact that the architecture is open source.

nickik · on Jan 27, 2020

There are a couple nice innovations happening in RISC-V. A lot of nice FPGU stuff where you can extend the architecture yourself and tooling that supports that.

There are some cool RISC-V chips out that have different advantages, AI accelerators, lower power and so on. Lots of security stuff as well.

I mean sure, all those things exist in the ARM world to some extend or another, but there is defiantly cool stuff being built.

loeg · on Jan 27, 2020

Why should a random tinkerer choose ARM in particular?

creato · on Jan 28, 2020

I didn't say anything about ARM specifically either. If I were a random tinkerer, I would choose based on these factors:

* Ecosystem support (libraries, stack overflow, docs)

* Quality of tools (compilers, debuggers)

* Price/availability

ARM is probably a good choice based on these factors, due to things like Raspberri Pi and Arduino (not all ARM but maybe the more popular ones are?). Open source hardware architecture doesn't really factor in anywhere here.

imtringued · on Jan 29, 2020

Because random tinkerers have no pressure to deliver a profitable product.

Erlich_Bachman · on Jan 27, 2020

Then to be honest it's a bit difficult to explain this to you in one post. Assuming you are even willing to consider any other perspective than you own egoistical current short-term position, then if you want you should read up on how this industry operates, on politics in it, on how processors and architectures are contracted and licensed, on what lobbying and political tools do big manufacturers like Intel use in order to get their advantage, about the history of the different architecture in general, on cybersecurity of the aspects of open/closed silicon, etc. etc.

In short, yes RISC-V is a huge deal and if you care anything about software, hardware or computing in general you should be very happy and optimistic because of its existence. The team behind it is doing a tremendous monumental job of creating a free architecture, and they are still in the beginning of it.

syntaxing · on Jan 27, 2020

Yeah, I've been following SiFive for a while now, especially since their announced tool for custom "ASICs" and I know they're a huge driver for RISC-V. I just haven't seen their MCUs used much and curious what advantage they have over other competitors.

nickik · on Jan 27, 2020

Their value proposition is basically that you can get to market with a costum chip cheaper then with anybody. They argue that their chips are lower energy then use less space. They are more configurable, and the RISC-V tooling is build around this as well.

They do seem to quite a few costumers but their current growth is VC funded.

There is nice stuff coming down the pipe as well, the RISC-V Vector extension and hopefully finally some linux boards. RISC-V SoC with FPGA is going to be a product. But hardware product cycles are just long and a takes a while.

karnajitw · on Jan 28, 2020

https://www.sifive.com/blog/with-sifive-we-can-change-the-wo...

lsllc · on Jan 28, 2020

This totally negates @bradfitz leaving Google/Go!

pranith · on Jan 27, 2020

Looking forward to seeing a high performance dev-board from SiFive.

bglusman · on Jan 27, 2020

Is there some connection to CarbonFive[0], the consultancy, or did they just.... um... completely duplicate their logo by accident or something?

[0]https://www.carbonfive.com/

zamadatix · on Jan 28, 2020

Both were probably designed the same way - company name is "word five" so take the first letter and fit it into a 5 sided polygon and bam you've made each logo.

SiFive's actually fits this design more even though it came out later. They don't need the added shape at the bottom and the embedded character can be seen as either "s" or "5" while the carbon 5 one needed the extra bit at the bottom and is a bit harder to see the "5".

bityard · on Jan 27, 2020

Did CarbonFive completely duplicate the HTML5 logo by accident or something?

https://www.w3.org/html/logo/

jmccarthy · on Jan 27, 2020

https://blog.carbonfive.com/2011/01/19/world-wide-web-now-ca... :)

bglusman · on Jan 27, 2020

I don't think that's nearly as close to Carbon5's logo as the SiFive logo is to Carbon5's, but fair point, there's always overlap in these... when I posted that I actually couldn't see the difference between their logos and was genuinely a bit confused if they were connected, but I see a small difference at the bottom of the Pentagon now... so it seems unlikely there's an intentional connection, but was genuinely confused at first!

m0zg · on Jan 27, 2020

So what's gonna happen to MLIR and TensorFlow Swift now? Dead?

Joky · on Jan 27, 2020

I wouldn't be worried for MLIR.

MLIR is getting used internally more and more inside TensorFlow, but also by separate team in different projects, like IREE for example (https://github.com/google/iree ). The TensorFlow lite converter has been replaced by the new MLIR-based one, similarly for the Edge TPU.

But more than that, what makes me confident is the traction we are getting outside Google. First, we landed MLIR in LLVM last month: https://github.com/llvm/llvm-project/commit/0f0d0ed1c78f1a80...

The LLVM Fortran frontend (f18/flang) which will merge soon in the LLVM monorepo is using MLIR for their own IR. It'll be exiting to develop a non-ML MLIR-based frontend within LLVM! In particular Flang is opening an HPC perspective that could be leveraged by other DSLs later. They are adding an OpenMP dialect to MLIR right now: https://llvm.discourse.group/t/rfc-openmp-dialect-in-mlir/39...

Intel has been actively porting their nGraph/PlaidML framework to be based on MLIR (search for "The Stripe dialect" and "nGraph Dialect" here: https://mlir.llvm.org/talks/ ).

I'm less familiar with S4TF, but I know they recently got some nice new hires, including https://twitter.com/DaveAbrahams/status/1207690883782467584

enos_feedler · on Jan 27, 2020

Chris Lattner actually gave a talk at the last MLIR open design meeting. This may or may not ease worrying about MLIR's future.

Joky · on Jan 27, 2020

"2020-01-24: Thoughts on Tensor Code Generation in MLIR" here: https://mlir.llvm.org/talks/ for reference.

m0zg · on Jan 27, 2020

Edge TPU will eventually be canceled as well, I'm pretty sure, which is why, as exciting as it is otherwise, I'm not using it for anything practical. There's just no way to make $1B/yr with it, so it's officially below Google's executive interest threshold.

DiffProg · on Jan 27, 2020

By far the #1 committer and founding engineer of Swift for Tensorflow also recently left the project: http://rxwei.me/about/ . From the committer list, it looks like Dan Zheng is the only individual who is actively working on it, and he's was an intern until he formally started January 2019. I don't know the ins and outs of Google culture, but it looks at least like every senior engineer has moved onto something else.

auggierose · on Jan 27, 2020

Doesn't mean Swift for Tensorflow is dead. Could mean Swift for Tensorflow will properly support Metal soon :-D

jph00 · on Jan 27, 2020

There are medium sized teams at Google for each of MLIR and s4tf. There's a weekly open design meeting for each of them. Many different folks have presented their work at both meetings.

rezahussain · on Jan 28, 2020

Yea but they cancelled this week's meeting, and I guess this is why

nochance · on Jan 27, 2020

Not clear with MLIR, since it seems to have some traction beyond S4TF.

But yes, I think S4TF has had zero interest among people actually doing ML and is probably dead. It might take Google a while to actually kill it off, though.

From conversations I’ve had with people doing ML, if they know about S4TF at all they actively dislike certain aspects (for example that it’s based on a statically typed language), or are dismissive of others (like support for differentiation, which is a neat parlor trick but saves a tiny bit of effort).

m0zg · on Jan 27, 2020

Not "zero" IMO. I, for one, was excited about its potential, I was just waiting for it to mature (wisely in retrospect). Python is really bursting at the seams for ML/DL at this point and the ecosystem is in need of a proper compiled language for implementing deep learning systems. This language must be easier on the newcomers than C++.

Imagine a future where you could tell if your shit's broken by recompiling it, and your "deployment" would be just putting some RPC in front of your existing code.

jph00 · on Jan 27, 2020

Most people I know that understand s4tf well (including me) are excited about it. Chris Lattner and I co-taught two video lessons on it - have a look at those and see what you think: https://course.fast.ai/videos/?lesson=13 .

lalord · on Jan 28, 2020

Hey Jeremy, I'm a really big fan. I got really interested in Swift based on your initial blogpost on it, and managed to get the company I work at to invest a bit in researching how suitable it could be for real time ML on video.

If it's not too forward, do you know at the moment if FastAI will continue to invest in Swift? I know development was a bit stalled due to the work needed for FastAI 2.0, but I was wandering if it'll resume, or if Richard Wei and Chris Lattner leaving changes anything?

Cheers and thanks for FastAI!

PD: Was working a bit on SwiftCV, added the videoio and highgui modules. Was wondering if I should PR it to fastai's or vvmnnnkv's version of the repo?

m0zg · on Jan 27, 2020

I think you might have misread. I'm excited about it, it just wasn't mature enough for anything client-facing for me.

jph00 · on Jan 27, 2020

I didn't misread - I was agreeing with you! :) And disagreeing with the person you were responding to.

byt143 · on Jan 27, 2020

Would you be open to doing a Fast.ai initiative in Julia? If so, what would that take?

jph00 · on Jan 27, 2020

I would be delighted. I've even spoken to the Julia core team about it. Hopefully one day they'll make that happen! (Just like we developed SwiftAI thanks to support from Chris Lattner and the s4tf team).

pjmlp · on Jan 27, 2020

How can I try those lessons on Windows?

jph00 · on Jan 27, 2020

Probably easiest is to use Colab, which supports Swift directly. Otherwise, use one of the approaches shown on the course site: https://course.fast.ai/

(Most data scientists on Windows, including me, use SSH to connect to a GPU server running Linux for training models.)

pjmlp · on Jan 28, 2020

So not an option against the competition that has no issues supporting Windows as first class citizen.

m0zg · on Jan 28, 2020

Windows needs to die though. The sooner it dies, the sooner we can all just assume that everything has bash and a bunch of excellent compilers installed.

pjmlp · on Jan 28, 2020

There plenty of OS that aren't POSIX clones, Windows is not alone.

Anyone that cares about bash on Windows can install mingw or WSL, preferably WSL2.

Tablets don't have CLIs by default.

Even if UNIX clones ruled the world, chsh is a thing.

pjmlp · on Jan 27, 2020

That language is Julia, plus there are ongoing integrations with Java, .NET, Go, Rust, Haskell, OCaml.

All of them much more mature than Swift, outside Apple's eco-system, including being able to target CUDA.

Also C++17/C++20 isn't that bad.

I learned C++ARM at the age of 16, and many Portuguese universities teach it at first year students, surely newcomers can grok it.

m0zg · on Jan 28, 2020

C++ isn't that bad if you already know it. I agree. I enjoy programming in C++ myself. For anyone just starting out, it's pretty horrifying, which is why there aren't a lot of experienced C++ programmers around. Its worst trait is that it encourages excessive cleverness, and left unchecked that can very quickly make any code impossible to understand.

Also Portuguese universities seem to be out of sync with the rest of the world if they teach C++. In the US most people with a CS degree haven't been exposed to C++. It's either Java or something functional. C++ typically doesn't even come up. Which is pretty puzzling to me, because most of the programs I use daily (including this very browser) are C++ programs.

pjmlp · on Jan 28, 2020

Portuguese universities aren't the only ones, I am pretty sure many of the other European ones do the same.

Also, I actually learned C++ during high school, back when C++ARM was the only "standard", and most C compilers still did a mix of K&R and C89.

Nowadays many high schools do stuff with Raspeberry PI and Arduinos (Processing is just C++).

And you have books like these available https://www.amazon.de/f%C3%BCr-Kids-Grundlagen-Spieleprogram...

m_ke · on Jan 27, 2020

I was also pretty excited when I heard that they were baking autodiff into the language but unfortunately Swift has very little traction outside of mobile development, which makes it a hard sell.

I think a compiled subset of python is the best way to make that happen. All of the JITs (Numba, PyTorch, Jax, etc) already handle a decent portion of the language, so it should be doable.

m0zg · on Jan 28, 2020

No compile time type / function signature checking - no deal.

zapnuk · on Jan 27, 2020

My guess is that Swift+Tensorflow on hold.

Last I checked, Swift is still years away from 1st class Windows support and adequate tooling. Without those in place, Swift+Tensorflow is still a very niche product for Google and I cannot think of a good enough reason why they should heavily invest into it.

seanmcdirmid · on Jan 27, 2020

Why would windows matter so much for tensorflow? Isn’t most ML work these days done under Linux?

pjmlp · on Jan 27, 2020

Because that is what we get on our desks, managed by IT.

Python Tensorflow, Julia, ML.NET, Tensorflow for C++, Pytorch, DL4J do all pretty well on Windows.

jeffshek · on Jan 28, 2020

A lot of work from researchers are using *conda on Windows. ML research is happening across all fields, not just computer science.

fmap · on Jan 27, 2020

Wild speculation, but my guess is that both will continue to be developed.

As far as I understand, the MLIR project predates its machine learning application and was originally intended as a new IR for clang. In that capacity it makes a lot of sense. MLIR is also currently experimental in Tensorflow, although I have no idea how mature the implementation is.

Similarly, there has been significant investment into Swift for Tensorflow, so it's probably here to stay. On the other hand, from a language design perspective Swift is not a particularly good choice for automatic differentiation and translation into Tensorflow graphs (imperative, exposing many details of the underlying machine, etc.). Without a lot of investment into this project it might just be overtaken by a better engineered competitor, or more likely, fail to gain sufficient mind-share over the "good enough" python solution that already exists.

chrislattner · on Jan 28, 2020

https://twitter.com/JeffDean/status/1222033368700706816

eklavya · on Jan 27, 2020

Depends on the community, more and more communities are proving to be more reliable when it comes to core tech pieces.

eklavya · on Jan 28, 2020

I just saw the last commit is from Eugene Burmako, who has previously done awesome work with Scala compiler. I think the project is in good hands.

cwyers · on Jan 27, 2020

There's a lot of weird hyphens strewn throughout this post.

1MachineElf · on Jan 27, 2020

I use lots of hyphens like this too. Is it grammatically incorrect?

jmull · on Jan 27, 2020

Some of the hyphens here are fine and some aren't.

One legitimate use of hyphens is to break a word at the end of a line so that part of the word is on one line and part is on the next line. (This was uses a lot in the days of physical newspapers and magazines which uses pretty skinny columns but didn't want to leave ragged whitespace that wrapping the full word would cause.)

In this case, it looks like the text was wrapped like that, hyphens were inserted, but then the text was rewrapped (without hyphen breaking) to a different width, but the hyphens from before were still left behind.

E.g., you can see a line on text like this: "Similarly, the RISC-V architecture pro-vides unique opportunities for SoC customization at every level. This is only possible with SiFive’s ambi-tious design methodology, which is unmatched in the industry."

where "pro-vides" and "ambi-tious" are inappropriately hyphenated.

The text probably looked like this at some point:

    Similarly, the RISC-V architecture pro-
    vides unique opportunities for SoC 
    customization at every level. This is 
    only possible with SiFive’s ambi-
    tious design methodology, which is 
    unmatched in the industry.

jez · on Jan 27, 2020

Worth noting that this is the whole purpose of the Soft Hyphen character:

https://en.m.wikipedia.org/wiki/Soft_hyphen

the browser won’t display any glyph unless it breaks the word at that point. Alternatively, you don’t need to insert soft hyphens at all if you’re fine relying on the browser’s hyphenization dictionary for your language, and the CSS word-break property is set. The downside there is that sometimes the browser will get really aggressive and terminate nearly every line with a broken word, reducing legibility.

wmf · on Jan 27, 2020

This is probably caused by copying and pasting from PDF.

smcl · on Jan 27, 2020

I'm not sure whether GP refers to joining phrases up (like "end-to-end" or "idea-to-silicon" or "one-stop-shop") or using it instead of a comma ("I also spearheaded the creation of Swift - a programming language that powers Apple’s ecosystem - and led a team at Tesla that applies a wide range of tech in the autonomous driving space"). The former is just a matter of taste (but IMO has a teensy flavour of "marketing" about it), the latter is totally fine too.

Khoth · on Jan 27, 2020

I think GP is referring to stuff like "pro-vides" and "program-ming" which look like they were hyphenated to line-wrap, but depending on the width of your browser window probably don't actually split across lines

52-6F-62 · on Jan 27, 2020

To dive down into the pedantic with you all, I think in the latter case the hyphen is incorrect. Normally in that place is an en(–) or em(—) dash. Hyphens are used for joining or breaking words. IIRC

thewebcount · on Jan 27, 2020

The GP is talking about how the text was written with hyphens inserted where words crossed a line break, but then the text was reflowed so the lines no longer break at the same spot. For example, in the original, it looked something like this:

an ex-

ample

But on GP's (and my) screen it looked like this:

an ex-ample

october_sky · on Jan 27, 2020

FYI, more comments over on this post: https://news.ycombinator.com/item?id=22160226

dang · on Jan 27, 2020

We merged them hither.

baybal2 · on Jan 27, 2020

I have hard time connecting "machine learning" with what RISC-V is about.

Above all, the original plan for RISC-V was to make a barebone MCU ISA first, and everything else second.

This was largely to ARM being very militant with terms on RTL access for M* series cores for commercial use.

If you throw enough extension, and workarounds even on top of 8051, you should be able to make a CPU grade core with it. But you being able to do it, doesn't mean you should.

brucehoult · on Jan 27, 2020

RISC-V was originally developed because some vector-processor / ML people at Berkeley needed an extensible control processor for their specialized hardware.

They'd previously been using ancient 32 bit MIPS but they needed 64 bit, a good amount of spare opcode space for custom instructions, and reasonable licensing and nothing suitable existed so they rolled their own.

RISC-V with the almost-done Vector extension is likely to be a big force in ML hardware.

_chris_ · on Jan 28, 2020

To add to what you said, a 64 bit ARM ISA didn’t publicly exist yet when Berkeley started RISC-V.

pjmlp · on Jan 27, 2020

ML hardware has more to gain from FPGAs.

brucehoult · on Jan 27, 2020

Based on what?

Whatever number of ALUs / DSP slices you can put in an FPGA and soft-wire together, you can put just as many hard-wired in a custom SoC with lower area and cost, and faster performance.

An FPGA is good for prototyping this until you figure out the best arrangement, sure, but three months later you can have real chips.

pjmlp · on Jan 27, 2020

Based on Google's TPUs, and Microsoft's Brainwave.

monocasa · on Jan 28, 2020

TPUs are not FPGAs.

pjmlp · on Jan 28, 2020

> Tensor Processing Units (TPUs) are Google’s custom-developed application-specific integrated circuits (ASICs) used to accelerate machine learning workloads.

https://cloud.google.com/tpu/docs/tpus?hl=en

> Project Brainwave is a deep learning platform for real-time AI inference in the cloud and on the edge. A soft Neural Processing Unit (NPU), based on a high-performance field-programmable gate array (FPGA)

https://www.microsoft.com/en-us/research/project/project-bra...

I got the TPUs wrong and Brainwave right.

What they certainly are not is RISC-V.

nickik · on Jan 28, 2020

The guy who designed TPU works on RISC-V and is a fan of RISC-V. And on of the reason was his experience with TPU, he realizes that custom hardware is really great but a more generally programmable approach would have advantages.

That's exactly what the RISC-V Vector extension was designed for. When they started TPU RISC-V wasn't ready for something like that and the Vector extension was at best an idea.

Saying 'RISC-V' is bad for ML because XY product doesn't use it, is a terrible argument in general. Over the next 10 years we will have literally 1000s of different things that make AI fast. All of those come with tradeoffs.

RISC-V as a base chip for many system is clearly a great fit, even on FPGAs. The RISC-V Vector extension is an excellent for many AI problems and many companies are currently working on that.

nickik · on Jan 28, 2020

> Above all, the original plan for RISC-V was to make a barebone MCU ISA first, and everything else second.

This is simply not correct. RISC-V was designed to allow minimal cores but by design it also tried to cover everything from minimal to HPC.

> This was largely to ARM being very militant with terms on RTL access for M* series cores for commercial use.

Again this is not correct. Are you just making this stuff up? The reason they developed RISC-V is because of complexity and license issues that would not allow them tape out say x86 or ARM.

> If you throw enough extension, and workarounds even on top of 8051, you should be able to make a CPU grade core with it. But you being able to do it, doesn't mean you should.

Why not? Is your instruction gone run slower because it spec is written down in a different PDF document? Are you mixing up ISA and micro-architecture?

If you take the whole RV64GCVBH you have a full featured core that is designed to allow super high performance implementation. And the ISA design is easily better then any of the other ISAs.

pjmlp · on Jan 27, 2020

An 8051 maybe not, but an army of Z80s...

https://www.ijcai.org/Proceedings/81-2/Papers/071.pdf

brucehoult · on Jan 27, 2020

100 MIPS and 16 MB of RAM from 256 Z80s in 1981. You can get the same from a single RISC-V soft core in a cheap FPGA board today. More, because a 32 bit RISC MIP is worth several times more than an 8 bit Z80 MIP.

bityard · on Jan 27, 2020

Reminds me of the Silicon Valley TV show sketch where the only goal of every single startup is "to make the world a better place".

https://www.youtube.com/watch?v=J-GVd_HLlps

kick · on Jan 27, 2020

Observational comedy is intended to give the viewer this: without it having a very real basis in reality, the joke wouldn't have been funny (at least, not in the same way).

OrangeMango · on Jan 27, 2020

"error bootstrapping app"

Quite a worldchanging blog entry.

dang · on Jan 27, 2020

Please don't post unsubstantive comments here. Also, please don't snark.

https://news.ycombinator.com/newsguidelines.html