Since we recorded this Lite Cone episode with Scale AI CEO Alexander Wang, Meta has agreed to invest over $14 billion in scale, valuing the company at $29 billion. Alex has also announced he will lead Meta's new AI super intelligence lab. Our conversation you're about to hear covers the history leading up to this investment. From scale's early days at YC to its integral role in the training of foundational models. Let's get to it. The AI industry really continues to suffer from a lack of uh very hard evals and very hard tests that show really like the frontier of model capabilities. The biggest thing is you just have to really really really care. When you interview people or when you interact with people, you can tell people who are just sort of like phone it in versus people who sort of like hang on to their work. It's like the so incredibly monumental and forceful and important to them that they they do great work. Very exciting time to to see the how the frontier of human knowledge expands. [Music] Welcome to another episode of the Light Cone. Today we have a real treat. It's Alexander Wang of Scale AI. Jared, you worked with uh Alexander way back in the beginning actually. Uh what was that like? What year was it? Put us in the spot. Yeah, Alex. I mean, most of what we want to talk about today is like what Scale is doing now because like the the current stuff is like so so awesome and so interesting since Scale got started at YC. I thought it just seemed appropriate to start all the way at the start. And um you it's funny uh Diane and I were at MIT last month talking to college students and like of all the founders, the one that they like most look up to and like want to emulate is actually you. Like everybody wants to be the next Alexander Wankers. Everybody knows the story of how you like dropped out of MIT and and ended up starting scale, but they don't know the real story. And so I thought it'd be cool to go back to the beginning and just talk about the real story of how you ended up dropping out of MIT and starting scale. So before I went to MIT, I worked at um Quora for a year. And so this is 2015 to 2016 or no sorry 2014 to 2015 was when I worked as a software engineer and this was already at a point in the market where ML engineers as they were called or like machine learning engineers uh made more than software engineers. So that was already like the market state at that point. I went to these summer camps um that were that were organized by um by rationalists the rationality community in San Francisco. So um and they were for precocious teams but they were organized by um uh many people who have become pivotal in the AI industry. So one of the organizers is this guy Paul Cristiano who um used to uh who's the inventor of RHF actually and now he run or he's a research director at the US AI safety institute. He was at opening for a long time. Um Greg Brockman came and gave a speech at one point. Eleazar Yudkowski came and gave a speech at one point and actually I was very like when I was I don't know must have been uh 16 I was exposed to this concept that like potentially the most important thing to work on in my lifetime was AI and AI safety. So something I was exposed to very early very early on. So then when I went to MIT I was started MIT when I was 18. I like studied AI quite deeply. That was most of what I did in the sort of day job. and then um uh kind of got antsy applied to YC and then the idea was kind of like okay how could initially was like okay where can you apply um sort of like AI to things and this was um in the era of chat bots which is like crazy to think about actually um that there was like this like mini chatbot bubble boom yeah yeah yeah 100% in uh in 2016 um which is uh which was I guess spurred by magic right? Or or some of these apps and and uh Facebook had a big vision around chat bots. And anyway, there's this little mini chatbot boom. So, the initial thing that we wanted to work on uh and um was was chat bots for doctors, right? Which is like a funny idea because do you guys know anything about doctors? Yeah. No, not at all. um like basically no I it was just sort of like oh doctors are a thing that sound expensive and so and I think it was like I think it's like indicative of like I mean I don't know you guys see this all the time but I feel like most of the times young founders like first 10 ideas are like always first of all they're very mimemetic so they're probably like there's a lot of like the same ideas over there's like a dating app there's like some something for like you know social life you know there's the same ideas um and then I think that like I think young people have a very poor sense sense of alpha like what are what are the things that they're actually like going to be uniquely positioned to do and I think you know most young people don't have a sense of self so it's you know it's not clear so when we were in YC we were roommates with uh with um with another YC uh company and we were sort of like um we were sort of observing this like this like chatbot boom ahead of you know that was happening at the time um and but it was very clear that like um chat bots if you wanted to build them, and this is funny to say in retrospect, required lots of data um and required lots of like human elbow grease um to be able to get them to work effectively. And so like just like kind of off the cuff at one point was like, oh, like what if you just did that? What if you just did the data and the the like language data and the the human data so to speak for the chatbot companies? We were also very lost, by the way. I think you probably remember we we were we were quite lost mid batch. um uh and like many YC companies I think and so then we um switched to this like concept I think the you know the initial idea was like API for um for human tasks or something along those lines and uh and one night I was just like trolling around for domains scaleappi.com was available and then we just bought it we launched it I think a week later we product hunt yeah I remember the the product hunt page is still live I was reading it last night and I remembered the tagline line. It was an API for human labor. Like that that that that's my recollection of sort of like the like distilled insight that you had was like what if there is an a what if you could call a human with like an API? Yeah. And that was I mean I think it was like 3 days for us to put up the landing page. It launched on product hunt. I think this idea captured some amount of imagination of the like of the startup community at the time because it was sort of like this weird form of futurism where you have like humans delegated like APIs delegate to humans in this in this interesting way. It's like an inversion of the Yes. Yeah. Humans doing work for the machines instead of the other way around. Yeah. Yeah. Yeah. It's funny because the the initial phase, you know, we sort of we just worked with all these engineers who reached out to us from um from that product hunt which was a real grab of use cases, but then that was enough for us to raise money at the time and like you know uh and to get going and then a few months after that uh we it became clear that like self-driving cars was actually the first major application that we needed to focus on. And so there were many uh very big decisions I would say in the first like year or so of the company. One thing that was curious is at that point there were other solutions that were already the game in town like mechanical turk from Amazon was sort of the thing that people were using but you ended up capturing this whole other set of people that didn't know about it and you had a way better API and you kind of won.
Yeah. It was not clear at that point because you probably were compared a lot with mechanical turd. Yeah. So, Mechanical Turk was definitely the sort of like um the concept in most people's mind at the time. I mean, it was just it was kind of one of these things where I think a lot of people had heard about it, but anyone who had used it knew it was just awful. And so, it's like whenever you're in a space and that's kind of the like that's like the thing. It's like people mention a thing, but it sucks, that's usually like a pretty good sign. Um, and so that was that was enough to give us like early confidence. But then I think the thing that like really I would say the the um the thing that was as actually fundamental to the success of the the company was actually focusing on this like on this like seemingly very narrow problem of of self-driving cars. I think that um you know I remember very early on when it was maybe like six months after we were out of YC basically um there was another YC company Cruz that that had reached out to us on our website and sort of like in the blink of an eye they became our largest customer and they found you just from your launch or yeah just yeah I think maybe even Google like I it's not even totally obvious but just yeah vaguely from our launch and vaguely it was actually an XYC founder that uh was working at Cruise that reached out to us so maybe some YC mumbo jumbo. We're a ketti. Yeah. Uh, who knows? The world works in mysterious ways, but and so they grew very very large. So then early on we made this decision and I remember we we we um went to our lead investor at the time and you know we had this conversation. It's like hey actually we think we should probably just focus on this self-driving thing. You know it was actually a very interesting conversation because the reaction was like oh that's just like obviously way too small a market. um then like you you know you're never going to build like a gigantic business that way. Um and we were like we think it's probably a much bigger market than than you think it is because there's like you know all these self-driving companies are getting crazy amounts of funding and the automotive companies are doing huge programs in self-driving and it clearly is the future. Like it feels like something that that um that should exist and so we're like if we focus on it we think we can build like build the business much more quickly. And it's funny looking back because both things are true. It is both true that it enabled us to build the business to be to get to scale pretty very quickly and it is also true that that was not a big enough market to sustain a gigantic business. The story of scale in many ways is like this progression of like how do you continue you know AI is this incredibly dynamic space. Um, lots of things are constantly changing and um, a lot of I think what um, what we pride ourselves on at the company is how we've been able to um, continue building on and and um, contributing to this very fastmoving industry. When did you uh, become much more aware of the scaling laws because you know uh, one of the interesting facts that sort of emerged is that uh, you're a little bit the Jensen Hang of data. I think that in self-driving um scaling laws were not really a thing. Um because and the fundamental the the biggest reason actually was that like one of the biggest problems in self-driving is that your whole algorithm needs to run on the car and so you're very constrained by the amount of compute you have access to and is available to you. So like a lot of the engineers and a lot of the companies working on self-driving never really thought about scaling laws. They were just all thinking about like, okay, how do you keep grinding these algorithms to be better and better better that are like small enough to fit onto these um onto these cars? But then we started working with OpenAI in 2019. This was like GPD2 era. Um and I would say like GPT1 GPT was sort of like this curiosity. GPD2. Um, I remember OpenAI like they would have a booth at these like large AI conferences and they would like, you know, their demo would be to allow researchers to like talk to GPD2 and it was like mildly like it was it wasn't like particularly impressive, but it was like kind of cool. is like kind of this thing and then um I think by GPG3 uh it was sort of this like that's when the scaling loss clearly um you know felt very real and that was I mean I think GPD3 was 2020 um so which is actually like long before before the world caught on to what was happening did did you know as early as 2020 did did you have a strong inkling that this was really going to be like the next big chapter of scale or not until chat GPT took off was was 35 or was it four? I think that like um in 2020 I think it was clear that scaling laws were going to be a big thing, but it was still not totally obvious. you I remember this like interaction, you know, I I got early access to GPD3 and then it was like in the playground and then I I was like playing with it with a friend of mine and uh I I told the friend of mine, "Oh, you can like talk to this model." And during the conversation, um uh my friend got like visibly frustrated and angry at the AI, but in a way that wasn't just like, "Oh, this is a dumb like toy." It was like it was in a way that was like somewhat personal. And that's when I was I realized like whoa, this is like somehow qualitatively different from anything that had existed before. I feel like it was passing the touring test at that point. Kind of. It was like semblances. Yeah, semblance. It was like sort of like the the the glimpses of it potentially passing the touring test, right? But I think the thing that really um caused the recognition of I would say generative AI, which is still even the term in some ways, it was really Dolly I think that that um that convinced um that convinced everyone. But I think I think my my personal um journey was like GBD3 like was like highly interesting and then and so it was like one of many bets at the company and then in 2022 over the course of Dolly and and then um and then later chatbt and you know um GP4 etc. and we worked with open eye on instruct GBT which is kind of the precursor to chat GBT. it became very obvious that that was like the at the farm moment for the for the company and for frankly the world. That's when we saw it as well with the big shift in companies because it was that 3.5 moment release end of uh 2022 and we started seeing a bunch of companies and smart people changing directions and pivoting their companies in 2023 and that was that moment this dynamic that you referenced which is kind of the you know scales the NVIDIA for data kind of thing. Um I think that became quite obvious. Um I would say GPD4 really was the moment where it was like it was like wow this is like like scaling laws are very real. The need for data will basically you know grow to consume you know all available information and and um and knowledge on that humans have. And so um it was like wow this is this is like this like astronomically large opportunity. Yeah for seemed like the first time it was something that you could uh get to not hallucinate basically ever you could actually have a zero uh hallucination experience in limited domains and which is we're still sort of in that regime even at this point. You know, the classic view is that if it's hallucinating, you're not giving it the correct data in the prompt or context or uh you're trying to do too much in one step. Yeah. I mean, I think I think like the reasoning paradigm is is is has a lot of lags and it's actually been interesting this last era of the of model improvement because um uh the gains are not really coming from pre-training um which is so so we're like moving on to a new scaling curve of of reasoning and reinforcement learning but it's it's like shockingly effective um and and I think that you know it's the the analogies between like AI and and Mors law are pretty clear which is like you know you'll get on different like technical curves but like if you zoom way out it'll just be feel like this like smooth improvement of models.
One of the things that uh has been popping up with some of the like really big well-known rappers is they're getting access to full parameter finetunes of the base models especially the frontier base closed source models. Is that like a big part of your business or you know something that people are sort of coming to you for just like these verticalized full parameter fine-tuned like data sets? Yeah, I think this is going to be a like blueprint for the future, right? So right now I mean like the total number of large scale parameter fine tuned or reinforcement fine-tuned models is like still pretty small but if you kind of think about it like that like one version of the future is that every firm's core IP is actually their specialized model or their own fine-tuned model and just in the same way that like you know today you would generally think that the co the the uh IP of most tech companies is their codebase um in the future you would generally think that their their their specialized IP might be the model that powers all of their all their internal um workflows.
And what are the special things they can add on top? Well, they can add on um data and environments that are somehow specific very very specific to the day-to-day problems or information or challenges or business problems that they see um on a day-to-day level. And that's the kind of like really gritty real world information that you know nobody else will have because nobody else is like doing the same the exact same business motion as them. There's a lot of weird tension in that though. Um I remember uh friends of ours from one of the top model companies came by and they were like hey do you think YC and YC companies would give us their evals so we could train against it? And we were like no dude what are you talking about? Why why would they do that? Because that's like their moat. And then I guess now that based on this conversation, it's actually I mean eval are pretty important as a part of RL cycles. And then even the eval are not really uh the valuable part. The valuable part is actually the like properly fine-tuned model for your data set and your set of you know sort of problems. Yeah, it's like these Lego blocks, right? If you have the data and you have the environments and then you have the you have you know a base model, you like you know can stack those on top of each other get get a fine tuned model and obviously the eval are important. This is some of the tension and this is basically you know in a nutshell the sort of like um does AGI become a Borg that just sort of like swallows the whole economy in like you know has one firm or do you still have a specialized economy? My belief generally speaking is that you you still do have a specialized economy like the like these models are platforms but the like like alpha in the modern world will be determined by you know to what degree you're able to sort of like encapsulate your business problems into data sets or environments that are then conducive towards building like you know differentiated models or differentiated AI capabilities. Yeah, that's why asking for eval was so crazy to me because it's like okay you get the evals the base model is way better and then not you know now all your competitors have exactly uh the same thing that used to be your advantage. I think we will undergo a process in AI where we learn what the bright lines are, right? I mean, I think that like it's like very obvious and intuitive to tech companies that they should not give away their codebase and they should not give away their database. Like they should not give away their data, they should not give away their codebase. The analoges of that in a you know highly AI fueled economy I think will identify over time but are yeah the evals your data your environments etc. I think you have a very uh techno optimistic view of what the future is going to be with how jobs are going to be shaped. Can you talk more about that? Because I think you hinted at it before. It's going to be more specialized. It's not that all these jobs are going to go away, right? First off, it's undeniably true that we're we're uh at the beginning of an era of like a new a new way of working like like you know this there's this term that people have used a long time which is like the future of work. Well uh we are like entering the future of work or the certainly the next era and so work fundamentally will change but I do think um humans own the future and we we are we are like uh we have a lot of agency actually and a lot of a lot of choice in how this sort of like reformatting of of work or how the reformatting of sort of like workflows ends up playing out. You know, I think you kind of see this play out in uh in coding right now. And I think coding in some ways is is really the sort of like um case study for other fields and and other you know other areas of work where sort of the the initial phase is this sort of like assistant style thing where um you know you're kind of doing your work and then the models are kind of like assisting you a little bit here and there and then you go to a you know the sort of like cursor agent mode kind of thing where you're you're like um synchronously asking the the models to like carry out these workflows and you're sort of like you're you're managing like one agent kind of or you're sort of like you're kind of like pair programming with a single agent and then and then now with like codecs or other systems like it's it's very clear the paradigm is like oh you have this like you have this like swarm of agents that you're going to deploy on like all these various tasks and you're just going to like sort of like you know dep like um give all these tasks and you'll have this sort of like um this this cohort of of agents that are sort of like you know doing this work that you you think is appropriate. And that last job um uh has a has a semantic meaning in the in the current workforce. It's a manager. You know, you're basically managing this sort of like this set of agents to do um actual work. And so and and I think that like AGI or you know AGI or doomers or whatnot like they take this view that like oh even this job of like managing the agents will just be done by the agents. So like humans will be taken out of the of the process entirely. But our belief, my personal belief is that you know this is um management is very complicated. Um management is also like more about like what's the vision that you have and what's the sort of like what's the like end result you're aiming towards and those will be fundamentally I think like you know we have a human demand and human desired driven economy.








