Lapsus$ hackers leak 37GB of Microsoft's alleged source code

petercooper · on March 22, 2022

Because I had a similar thought when Twitch's code "leaked" and I never got round to asking: is it illegal under US law to download such a leak for private inspection? That is, illegal in a way beyond copyright violation, such as being party to an act of espionage.

(This leak doesn't interest me at all, but if ever the source for QBasic leaked, say.. well I might like to read it without becoming a wanted fugitive ;-))

leeter · on March 22, 2022

IANAL, the risk is generally not in the downloading per-se but the distribution and use. The copyright owner can sue for use or distribution of the IP. So you might be able to download and look but that would then basically prohibit you from making your own compatible QBasic interpreter. You're better off clean rooming it generally speaking. So it's basically poisoned, you can look but as soon as you do you can't touch anything to do with that area again until you can plausibly say you forgot everything you saw.

phendrenad2 · on March 22, 2022

I would add that if you make a competing product to Microsoft, and they find out that you looked at their source code, they might use that as justification to open up a lawsuit to check if you copied their code at all. Or if they open a lawsuit to see if you copied code, the fact that you looked might be used against you. IANAL so I could be wrong...

Anunayj · on March 22, 2022

So this technically makes piracy legal, as losing as I'm not distributing the material, (or seeding it's torrent).

kmeisthax · on March 22, 2022

Not necessarily. The reason why torrent seeding is so perilous is because P2P is a privacy nightmare, not because they forgot to say the word "download" in the law. If you want to sue people who are copying your work, BitTorrent makes it very easy to get enough information to demand dox from an ISP. For a "mere downloading" case to even occur, you first need to compromise (legally, of course) the host of some direct-download site, subpoena their logs, subpoena a bunch of ISPs, get dox from that, and then sue that class of users. Usually at that point the copying has stopped anyway, which is enough to get a copyright owner to back off.

If such a suit were to happen, the argument would probably boil down to where the infringement actually occurred. Does the server infringe when it sends copyrighted material (because that's where the copy is made), or does the client infringe when it requests said material (because they asked for an infringing copy)? Courts might accept both arguments and just decide everyone is liable.

Also all of that is for copyright, which (usually) covers published works. Trade secrets, which cover things not ever intended to be published, would just call both sides guilty of "misappropriating" the trade secret.

jeroenhd · on March 22, 2022

IANAL. If you pirate without distribution (so torrents are out of the question) and don't circumvent any DRM mechanisms (cracks, keygens) then I don't think there's a legal basis for a lawsuit.

However, most games and software come with some form of DRM which you need to bypass to pirate them, and that's often banned explicitly in copyright legislation. It is under the American DMCA laws and, as far as I know, under European copyright laws, but your mileage may vary.

But yes, as far as I can tell, if someone shares their Good Old Games setup files with you and you download them, you're not breaking the law (though the person sharing the content obviously is).

Regardless of actual legality, you can still expect a lawsuit if your company pirates software and defending against that usually costs more than actually buying the software.

addaon · on March 22, 2022

Downloading a file which contains copyrighted data is creating a copy (you now have that sequence of bits in memory or on disk on a device you control) without the right having been granted, no?

jeroenhd · on March 23, 2022

According to the laws I know of, distribution is prohibited but receiving an illegal copy isn't. The server is offering an illegal copy and that is the main problem in most piracy cases. Usually the person who offers the copy has more resources and acts like a central part in the redistribution, so they're the easiest and most profitable targets of lawsuits. I'm sure that was considered when the laws were written.

But again, I don't know the specifics of copyright law where you live. Perhaps your jurisdiction considers the person accepting the offer of an illegal copy to be a criminal as well.

jcranmer · on March 22, 2022

The person running the server is the one who is creating the copy and doing the distribution, not you the user.

addaon · on March 22, 2022

Even if you argue that the initial copy of the work in RAM in your network stack was created by the server operator, you're likely to make copies from there: copies in RAM if your network stack is not zero copy; copies to disk if these pages swap; and copies to disk if you save the files.

See e.g. https://scholarship.law.edu/cgi/viewcontent.cgi?article=1602...

jl6 · on March 22, 2022

Dubious, when the user is the one initiating the copy (by requesting the download) and the one benefiting from the copy.

vmception · on March 22, 2022

Copyright infringement requires sharing the copy without a license to do so, not making the copy

addaon · on March 22, 2022

The right to make copies (reproduce a work) and distribute copies (sharing) are both separately protected by copyright law [1].

[1] https://www.bitlaw.com/copyright/scope.html

vmception · on March 22, 2022

thanks! people go after uploaders, who both created a copy and distributed the copy, the downloader is receiving yet another copy and by nature of the technology is "reproducing" simply by how filesystems work but it seems like the uploader is the center of attention

thoughts on that? It seems that enforcing this against a downloader would require every single piece of media have an explicit limited license for downloading, which isn't practical right now

leeter · on March 22, 2022

IANAL, but no. You can still get sued.

razakel · on March 22, 2022

You can be sued for anything. I could sue you for farting in an elevator.

You have to actually show damages, and nobody is going to care about personal research into obsolete software.

retrocryptid · on March 22, 2022

except oracle.

monkeynotes · on March 22, 2022

This is the case in Canada, at least.

Biganon · on March 22, 2022

Switzerland, too.

kmeisthax · on March 22, 2022

The repo would be a trade secret and downloading it could potentially be construed as misappropriation of such. However, I know of no case in which someone decided to sue every individual who downloaded a leak out of curiosity. Usually you'd only bother for large businesses that could actually gain a competitive advantage from that data.

Of course, the flip side of that is that anyone who has ever touched the leak is persona non grata in that industry. So if you're interested in learning how QBasic works, and you ever want to touch anything that interprets scripting code, don't touch those leaks.

FWIW disassembling QBasic (instead of obtaining leaked source) would be an absolute defense against a trade secret claim, but in terms of copyright you now have "access", and need to avoid "substantial similarity" in any source code you write. You aren't strictly-speaking "tainted" (clean-room is not a legal requirement), but if someone actually sued you for copying QBasic you'd better have a good legal argument for why every line of your code does not infringe upon the code you disassembled.

petercooper · on March 22, 2022

but if someone actually sued you for copying QBasic

Not to take away from your useful points, but that would be quite the case in 2022! :-)

op00to · on March 22, 2022

I would not download a code leak for a few reasons.

First, I want to be able to contribute code to open source projects, and I feel like seeing some "leaked" code could somehow taint me in a way that makes this more tricky for me.

Second, my employer expects me to act in a manner that reflects positively upon them. I don't think it's fair business dealings to read stolen code from someone who might be a competitor.

johannes1234321 · on March 22, 2022

> but if ever the source for QBasic leaked

QBasic? - Why that modern thing. Just use the proper BASIC: https://github.com/microsoft/GW-BASIC ;)

petercooper · on March 22, 2022

Haha, yes. When that came out I asked the guy at MS if QBasic was on the cards next and I seem to recall he said something about it being problematic and not super likely. I spent 1000x longer with QBasic though (QB4.5 and PDS 7.1 technically.)

sixothree · on March 22, 2022

If you want anything clean room related in your career, I wouldn't read any of it.

hughrr · on March 22, 2022

If it was like the mainsoft leak, I learned a hell of a lot about what not to do reading that.

captaincaveman · on March 22, 2022

I don't want their compiled code let alone their source code!

hutrdvnj · on March 22, 2022

I could imagine a LibreWindows where all the bloatware, advertising, microsoft backdoors and so on are completely removed from the source.

_vbnz · on March 22, 2022

Russia is funding ReactOS.

zozbot234 · on March 22, 2022

ReactOS is not based on Microsoft source code. It's a clean-room reimplementation and anyone who has ever looked at Microsoft code is permanently banned from contributing to that project.

hutrdvnj · on March 22, 2022

How do they know that their contributors never took a look at Microsoft source code?

razakel · on March 22, 2022

This was a major factor in the Oracle v. Google case, where some of the argument boiled down to "well, how else would you do that?".

ReactOS audited their code and showed that none of it was Microsoft's.

NullPrefix · on March 22, 2022

Honor based system

lostgame · on March 22, 2022

With what money? :P

jbverschoor · on March 22, 2022

Microsoft has been a supporter of open source lately anyway.

ushakov · on March 22, 2022

no sane org would touch this with a 10 foot pole

vluft · on March 22, 2022

all you have to do is run it through an "AI" and then what comes out is definitely not a derivative work and you can legally use it however you like. Microsoft is very confident in this; just see Copilot.

Uke · on March 22, 2022

And it would even be open-source copyleft [0] because the "AI" can't claim it's copyright either! Check mate patent trolls

[0] https://www.theverge.com/2022/2/21/22944335/us-copyright-off...

2Gkashmiri · on March 22, 2022

yeah, if they somehow manage to remove the original data, would copyleft be able to "remember" the idea of the code so that if we try to recreate it, would we be able to without modifying the copilot to prevent this?

justinclift · on March 22, 2022

It'd be interesting to see (and likely public interest) to have some good analysis done of Microsofts telemetry. eg anything untoward in there, apart from just the forced telemetry itself

Spivak · on March 22, 2022

But with some clean room due diligence this could be a boon to WINE.

mschuster91 · on March 22, 2022

There have been a lot of leaks over the years. W2K and NT4 got leaked in 2004, XP in 2020 [1], and that's just the publicly known leaks where stuff ended on the 'chan boards or torrent sites. It's more than likely that there have been more leaks (e.g. from one of the academia or government audit programs) that have never been widely dispersed.

And yet, both WINE and ReactOS have refused to use the leaks; ReactOS doesn't even allow people who have worked legitimately at MS in the past to be developers, simply because even the smell of contamination would expose the projects to enormous legal risks.

The only way I could imagine these leaks being useful is by "parallel construction" aka by comparing the source code with actual Windows binaries and then the WINE/ReactOS code to spot out differences to check, and then have a second person investigate the differences with only the note "check function XYZ with implementation in current Windows binary". But that's a lot of effort for very low reward, not to mention you'd need at least two very skilled experts and the low-hanging fruits having been picked already long ago.

[1] https://borncity.com/win/2020/10/01/entwickler-compiliert-wi...

[2] https://reactos.org/forum/viewtopic.php?t=20189

proxysna · on March 22, 2022

and does not matter if they like it or not /s

tailspin2019 · on March 22, 2022

If any of the hackers are here, could you run this on the Windows repo please?

  git reset --hard windows2000-sp4

/s

_Algernon_ · on March 22, 2022

Make it win 7 and I'd agree.

(Actually I don't care anymore. Modern linux beats any version of windows in all my use cases)

ASalazarMX · on March 22, 2022

> Modern linux beats any version of windows in all my use cases

Are you aware that hackers have been leaking the source code of Linux for years?

throw10920 · on March 22, 2022

You can find so many modded versions though...

ComradePhil · on March 22, 2022

> Modern linux beats any version of windows in all my use cases

It's the other way around for me. Windows Subsystem for Linux ftw.

I have used Linux as the primary Desktop OS for most of my life but I hope I will never have to go back ever again.

retrocryptid · on March 22, 2022

i use mDNS too frequently to find WSL useful. and was kind of surprised to find MS EDGE woukldn't connect to a node service run unf i WSL. but.... if you don't need networking, it's pretty decent. you can even get an x server and run XTerm and get VT320 emulation.

ComradePhil · on March 22, 2022

What you say about networking is not true. Networking works seamlessly between native windows applications and linux applications running on WSL.

alunchbox · on March 22, 2022

Make it vista and we're golden

belltaco · on March 22, 2022

Not sure if Vista was the low point or Windows ME. Probably Vista.

_Algernon_ · on March 22, 2022

Wrong verb tense there. The low point is windows 11.

sigg3 · on March 22, 2022

95 and 98 were horrible. Hearing the error chime still gives me physical reactions.

fomine3 · on March 23, 2022

Agree. UAC is pretty basic feature.

bezospen15 · on March 22, 2022

Lol ok

fps_doug · on March 22, 2022

I remember back in the day, I preferred SP2. It seemed to run notably faster on slower hardware and there were very few instances of software (I don't even remember any) that required a later one to run on 2k.

Just think about that. There was a time where you'd not use an older version of an OS, but an older patch level of an OS and it didn't feel particularly wrong.

mschuster91 · on March 22, 2022

> There was a time where you'd not use an older version of an OS, but an older patch level of an OS and it didn't feel particularly wrong.

Unless you had a network of these, like a school, and a couple enterprising pranksters with Metasploit.

apexalpha · on March 22, 2022

Granted, from now on all your electronic devices will run Windows 2000 IA-64.

no_time · on March 22, 2022

>all your electronic devices

Cool. Having the only Itanium powered oven/fridge/washer gives him insane nerd street cred.

jcranmer · on March 22, 2022

I can understand the oven, but surely being "Itanium-powered" would be counterproductive for a fridge?

giantrobot · on March 22, 2022

Your fridge yes, your oven no.

___8___ · on March 22, 2022

Are you sure you don't just want vms instead?

retrocryptid · on March 22, 2022

i miss VMS. it's sort of like WinNT but with all the stuff that doesn't work stripped out.

hestefisk · on March 22, 2022

windowsnt4-sp6a

ancode · on March 22, 2022

Bing and Cortana, the two most important and best working Microsoft products.

Seirdy · on March 22, 2022

Bing powers a host of other engines. Yahoo, DuckDuckGo, most of Qwant, AOL, the web results from Windows Search, Ecosia, and others are all dependent on Bing's search APIs.

If you use a Google alternative, there's a good chance it's just Bing under the hood. So this leak could be a pretty big deal.

dustinmoris · on March 22, 2022

There is a reason why they hide that code ^^

q1w2 · on March 22, 2022

My guess is that they did not leak Windows and Office source code because they want to mine it for 0days themselves.

Time to really lock things down folks.

denysvitali · on March 22, 2022

Or simply because (as the screenshot show) they got only access to Azure for an account that doesn't have access to the OS department of Microsoft

phendrenad2 · on March 22, 2022

It's too bad it's always existing companies that get their source leaked. Just once I'd like to see "Archive of Digital Equipment Corporation source code leaked online". I can dream...

tablespoon · on March 22, 2022

> It's too bad it's always existing companies that get their source leaked. Just once I'd like to see "Archive of Digital Equipment Corporation source code leaked online". I can dream...

The problem is those companies are defunct, so their source repositories may not even exist anymore, let alone be online, e.g.:

http://www.chrisfenton.com/homebrew-cray-1a/

> After searching the internet exhaustively, I contacted the Computer History Musuem and they didn’t have any either. They also informed me that apparently SGI destroyed Cray’s old software archives before spinning them off again in the late 90’s.

I know at my employer, there's always pressure to half-ass things that aren't directly connected to some mechanism for making money. We recently migrated our company intranet site from one vendor to another, and the team that was running that project it as a "feature" that they would help us "clean up" by not assisting us migrating anything older than one year. Similarly, source control migrations (of which we've done several) often drop history, since it's usually way easier just to download the latest version and check it into the new system than figure out how to migrate the metadata. IIRC, Microsoft's TFS-VC to git migration tool will only migrate something like 180 days of history.

skilled · on March 22, 2022

Sooo, are all of these leaks directly tied to the Okta access they had? That’s an absolute bummer for business if so. Sheesh.

ushakov · on March 22, 2022

no, the leaks are separate

gurjeet · on March 22, 2022

Oh no, what have I done!

I hope you're not referring to my comment here [1]. Please note, that was, and remains, purely a speculation/hypothesis; see the thread after that. I have no knowledge, firsthand or otherwise, of them exploiting people's build systems.

[1]: https://news.ycombinator.com/item?id=30763269

wiz21c · on March 22, 2022

> I have no knowledge, firsthand or otherwise

Sus !

PLG88 · on March 22, 2022

It saddens me that Microsoft cannot properly implement zero trust principles or account based access control to their DevOps environment. VDI and VPNs are not secure, no networks are secure!

hdjjhhvvhga · on March 22, 2022

First, I wouldn't be so harsh on them: statistically, the probability of successful attacks increases with the size of the company, and having lived long enough I consider it a miracle that the likes of Apple and Microsoft had seen so few leaks.

Second, zero trust is a very specific concept that basically refers to not trusting networks traditionally considered as more secure, such as corporate LANs. It is definitely not a panacea, not to mention that no large company, including Google, is able to implement it in full without incurring enormous costs.

Third, whatever you do, it's extremely difficult to protect against an inside job. I'm not suggesting it was a case at all, I'm just saying it's better not to jump to conclusions too hastily.

PLG88 · on March 22, 2022

Agreed, with a couple of caveats:

- based on the perenial patch Tuesday issues I am surprised it did not happen sooner.

- zero trust is a journey. we should accept that networks cannot be secure and instead look to implement principles of ZT away from the network. I like the open source OpenZiti project as a way to put strong identity and zt principles into our apps. Its not a panacea but it does make access and exploit much harder.

- correct, though if using attribute-based access controls we can at least massively limit what an insider could get access to... 37GB of source code across multiple different project at first blush looks like more than what 1 single user should have access to.

akdor1154 · on March 22, 2022

Or maybe even Microsoft has the odd one or two guys who like to check in their nuget packages dir.

baxtr · on March 22, 2022

> statistically, the probability of successful attacks increases with the size of the company

I work in the cyber insurance industry. This is not true.

Closi · on March 22, 2022

> I work in the cyber insurance industry. This is not true.

Really? I mean our small company has never had our codebase breached and released by hackers, while Microsoft and their subsidiaries have had this happen several times. Similarly Twitch, Github, Valve... all have suffered source code breaches similar to the article.

None of the small companies I have ever worked for have had this issue, so it does seem that large tech companies have a higher probability of having their codebase successfully leaked.

We are also talking about Microsoft, which is probably amongst the top companies that are targeted the most by hackers across the world (mainly because of the impact when they are breached, rather than the ease of breach).

I assume when OP talks about the likelihood of a successful breach, they don't mean the success % of a breach, they mean the total number of successful breaches. When I worked at a big company the security team seemed to be putting out small fires all the time with targeted phishing attacks and so many laptops that could have missed an update, virtual machines getting ransomware e.t.c., and now I work for a smaller company and look after their IT as part of my role we have only had 1-2 fairly small issues across the last year.

kreativ_py · on March 22, 2022

These are mostly social engineering attacks. As employees headcount increases, so does attack vectors.

caaqil · on March 22, 2022

> zero trust

https://twitter.com/cyb3rops/status/1506169666020028416

pwdisswordfish9 · on March 22, 2022

So you're saying Microsoft should have put their source code on a blockchain?

cenal · on March 22, 2022

I thought they tried that already with Windows Vista. I mean... it seemed to at least share a lot with Blockchain since the initial release had more variations than anyone cared to track, was slow, confusing, expensive, and hardly anyone used it despite all of the hype....

anothernewdude · on March 22, 2022

I think you mean their DevOops environment.

nonrandomstring · on March 22, 2022

> alleged source code

I may just be line noise, but compiles and runs just as well.

marginalia_nu · on March 22, 2022

So, perl?

nonrandomstring · on March 22, 2022

fave Perl dis; "an explosion happened at the ASCII factory"

petepete · on March 22, 2022

So 37GB of source code is clearly a lot, but for someone unenlightened what kind of size are we looking at for the bigger projects? For example, Windows, Office, Exchange.

thepill · on March 22, 2022

https://devblogs.microsoft.com/bharry/the-largest-git-repo-o...

realmod · on March 22, 2022

Interesting. Presumably, LAPSUS had access to Windows source code but still decided to go after comparatively low-fruit like Bing and Cortana instead of the digital gold that is Windows.

Cthulhu_ · on March 22, 2022

Wasn't Windows' source code leaked a number of times already?

I think Bing and Cortana will have some "algorithms" that might be worth a lot more for the right buyer. I mean Google's search algorithm is one of the best kept secrets in the industry.

criddell · on March 22, 2022

Windows source isn't all that hard to see. I know they've made it available to some universities and large customers also can get access.

https://www.microsoft.com/en-us/sharedsource/enterprise-sour...

traceroute66 · on March 22, 2022

> I know they've made it available to some universities and large customers also can get access.

And, IIRC, infamously the Chinese government too, because they made it a pre-condition of them purchasing Microsoft licenses that they must have source code visibility.

karteum · on March 22, 2022

"Infamously" ?

Well, there is for sure a lot to criticize about the CN government, but this precondition seems to me very natural (the OS is a very natural place for possible backdoors, otherwise...)

ComradePhil · on March 22, 2022

No government should buy closed source anything for obvious reasons.

bjarneh · on March 22, 2022

> because they made it a pre-condition of them purchasing Microsoft licenses that they must have source code visibility.

Even the communist party in China is more up to date then my own government

weeblewobble · on March 22, 2022

10,000 enterprise windows licenses minimum... that's a big customer

rwmj · on March 22, 2022

Windows source code is fairly widely available, as in government agencies, universities and others already have access. I'm sure this means anyone motivated enough could get it if they really wanted to. Of course even looking at it is problematic if you want to work on open source operating systems later, so I'm not sure why you'd voluntarily choose to do this.

fluoridation · on March 22, 2022

It can be useful even if you're not directly incorporating it into your code. For example you might want stronger guarantees than the API documentation offers (e.g. "this function will only ever return values between this and that in this particular version of Windows"), and being able to read the code to check if your assumptions are valid is very useful. I've worked in function hooking before and ReactOS has been a very useful resource on occasion.

easton · on March 22, 2022

When public documentation for the Hyper-V APIs sucks the way that it does, I'd be willing to risk not being able to write a operating system later if I could figure out a side project now ;)

bpye · on March 22, 2022

Full disclosure, I work on Hyper-V. Are you thinking of these docs - https://docs.microsoft.com/en-us/virtualization/api/hypervis... - or something else?

easton · on March 23, 2022

I'm thinking of the HCS docs (https://docs.microsoft.com/en-us/virtualization/api/hcs/over...). There's very poor documentation of the different types of VMs/Containers you can launch and how to launch them, I'm not sure how much of this is intentional or due to the newer container APIs being too new, but it's super frustrating when you're trying to understand how WSL2 or Windows Sandbox work (or honestly, how to use Windows containers without Docker).

thematrixturtle · on March 22, 2022

I think you misspelled "dumpster fire". Microsoft is known for going to extreme lengths to maintain backward compatibility, but for Windows in particular this means code that's been hacked on for decades.

kwccoin · on March 22, 2022

Whilst it might be treasure for hacker, but for "learner" could I say any Windows are horrible precisely due to this baggage.

_betty_ · on March 23, 2022

MS have multiple devops orgs separated by location i believe. Judging by the screenshot where it says MSASG, I believe that's the China/Asia one.

It does have other orgs in the screenshot, but all the leaks seem to be ASG related.

traveler01 · on March 22, 2022

Windows source code must be huuuuge...

hughrr · on March 22, 2022

Well it depends if it's a snapshot or clone or the repositories themselves. It could be a whole load of nothing that changes a lot.

bravetraveler · on March 23, 2022

It's crazy the leaking group is offering people who work at likely targets money.

Like, what's the exit strategy there? They use your credentials, leak stuff, and you take the fall?

cute_boi · on March 22, 2022

Anything interesting in this leak?

hestefisk · on March 22, 2022

Does anyone actually use Cortana and Bing?

dogleash · on March 22, 2022

Bing's pretty decent when you want to search the web for your query without Google attempting know what you're looking for better than you do.

Tenoke · on March 22, 2022

A lot of people use DDG which mainly uses Bing.

hestefisk · on March 22, 2022

True.

snvzz · on March 22, 2022

It would be fun if a backdoor was found in there.

retrocryptid · on March 22, 2022

cool. we can fix it now.

dirtylowprofile · on March 22, 2022

Anything badass...