undefined | Better HN

0 pointsdh20222y ago0 comments

You are assuming Microsoft code base is superior to Linux / Git / MySql / whatever else is in github right now. That is a .... big assumption.

And if Microsoft's code ends up influencing the rest of the world code that would be a .... big downside.

0 comments

Karellen2y ago

I don't think you should be looking at the best of the Microsoft/GitHub corpora to gauge their overall quality. You probably want to be looking at the quality of the median project, which is going to be heavily influenced by the long tail of low quality projects.

IMO, the long tail of non-code-reviewed, written-by-someone-in-their-first-month-of-coding, barely-even-compiles noob code[0] in Github is going to be orders of magnitude larger than the long tail of crap in Microsoft's internal repos.

[0] Hey, everyone has to start somewhere. There's nothing wrong with your first "hello world" program being buggy - that's what being a beginner means. But it's probably not the sort of code you want to train an LLM on.

Wowfunhappy2y ago

Now I'm wondering if the copilot AI (GPT3/4?) takes number of stars/forks/etc into account during the training process.

singleshot_2y ago

I would think if the LLM knew it was rookie code, it could actually be pretty useful, no?

giantg22y ago

Without a myriad of dumbasses like me being able to commit to Microsoft vs Github, I'd assume Microsoft's average is better than Github's.

blackhole2y ago

That is a... bold assumption to make. Not just for Microsoft but for any large corporation.

lelanthran2y ago

> That is a... bold assumption to make. Not just for Microsoft but for any large corporation.

I dunno; the average project on github isn't code-reviewed, while all the projects at Microsoft are.

giantg22y ago

I'm not saying bad code doesn't exist there. My thought is that the percent of bad code increases with volume (or at least higher number of producers). Tens of millions of people committing to Github should mean its more cluttered with garbage than in MS. I at least assume MS has some automated code standard or security scans. That's at least more than nothing.

1 more reply

dietr1ch2y ago

Yup, individuals struggling to have impact will cut corners and heavily impact tech debt :P

zik2y ago

I have a friend who worked at Microsoft... if his opinion is anything to go by that's very far from true.

eru2y ago

> You are assuming Microsoft code base is superior to Linux / Git / MySql / whatever else is in github right now.

How do you get that impression from the comment? I don't see anything implying that.

ryanwaggoner2y ago

Not for Microsoft it wouldn’t, which was their point.

j / k navigate · click thread line to collapse

0 comments

Karellen2y ago

Wowfunhappy2y ago

Now I'm wondering if the copilot AI (GPT3/4?) takes number of stars/forks/etc into account during the training process.

singleshot_2y ago

I would think if the LLM knew it was rookie code, it could actually be pretty useful, no?

giantg22y ago

Without a myriad of dumbasses like me being able to commit to Microsoft vs Github, I'd assume Microsoft's average is better than Github's.

blackhole2y ago

That is a... bold assumption to make. Not just for Microsoft but for any large corporation.

lelanthran2y ago

> That is a... bold assumption to make. Not just for Microsoft but for any large corporation.

I dunno; the average project on github isn't code-reviewed, while all the projects at Microsoft are.

giantg22y ago

1 more reply

dietr1ch2y ago

Yup, individuals struggling to have impact will cut corners and heavily impact tech debt :P

zik2y ago

I have a friend who worked at Microsoft... if his opinion is anything to go by that's very far from true.

eru2y ago

> You are assuming Microsoft code base is superior to Linux / Git / MySql / whatever else is in github right now.

How do you get that impression from the comment? I don't see anything implying that.

ryanwaggoner2y ago

Not for Microsoft it wouldn’t, which was their point.

j / k navigate · click thread line to collapse