|
|
Subscribe / Log in / New account

Developer statistics for 2.6.30

By Jonathan Corbet
May 27, 2009
As the 2.6.30 development cycle heads toward a close, it is natural to look back at what has been merged and where it came from. So here is LWN's traditional look at who wrote the code which went into the mainline this time around.

Once again, 2.6.30 was a large development cycle; it saw the incorporation (through just after 2.6.30-rc7) of 11,733 non-merge changesets from 1125 developers. The number of changesets exceeds 2.6.29, but the number of developers falls just short of the 1166 seen last time around. Those developers added 1.14 million lines of code this time around, while taking out 513,000, for a net growth of 624,000 lines.

The individual developer statistics for 2.6.30 look like:

Most active 2.6.30 developers
By changesets
Ingo Molnar3242.8%
Bill Pemberton2271.9%
Stephen Hemminger2041.7%
Hans Verkuil1991.7%
Takashi Iwai1881.6%
Bartlomiej Zolnierkiewicz1861.6%
Steven Rostedt1791.5%
Greg Kroah-Hartman1501.3%
Jeremy Fitzhardinge1251.1%
Mark Brown1070.9%
Jaswinder Singh Rajput1050.9%
Rusty Russell1000.9%
Tejun Heo980.8%
Johannes Berg980.8%
Hannes Eder880.8%
Michal Simek850.7%
Luis R. Rodriguez850.7%
Sujith850.7%
David Howells800.7%
Yinghai Lu780.7%
By changed lines
Greg Kroah-Hartman1203539.0%
ADDI-DATA GmbH434203.3%
Mithlesh Thukral424243.2%
Alex Deucher265762.0%
David Schleef259051.9%
David Woodhouse246361.8%
Ramkrishna Vepa234951.8%
Lior Dotan225061.7%
Eric Moore222661.7%
Eilon Greenstein183991.4%
Jaswinder Singh Rajput181681.4%
Hans Verkuil180481.4%
David Howells179411.3%
Andy Grover163551.2%
Michal Simek158271.2%
Sri Deevi155141.2%
Frank Mori Hess154501.2%
Ben Hutchings150311.1%
Ingo Molnar138761.0%
Bill Pemberton138171.0%

On the changesets side, Ingo Molnar is at the top of the list this time around; as usual, he created a vast number of patches - about five per day - in the x86 architecture code, ftrace, and beyond. Bill Pemberton is perhaps better known as the maintainer of the Elm mail client; he did a lot of cleanup work with the COMEDI drivers in the -staging tree. The bulk of Stephen Hemminger's work involved converting network drivers to the new net_device_ops API. Hans Verkuil continues to improve the Video4Linux2 framework and associated drivers, and Takashi Iwai continues to generate a lot of patches as the ALSA maintainer.

Linus kicked off the 2.6.30 development cycle by noting that about one third of the changes in 2.6.30-rc1 were "crap." So, unsurprisingly, the top three entries in the "by changed lines" column all got there through the addition of -staging drivers. Alex Deucher added Radeon R6xx/R7xx support; many of his "changed lines" were associated microcode firmware. And David Schleef added another set of drivers to the -staging tree.

Contributions to 2.6.30 could be traced back to some 190 employers. Looking at the most-active employer information, we see:

Most active 2.6.30 employers
By changesets
(None)197016.8%
Red Hat130511.1%
(Unknown)118410.1%
Intel8557.3%
Novell8327.1%
IBM6305.4%
(Consultant)2932.5%
Atheros Communications2622.2%
Oracle2522.1%
University of Virginia2271.9%
Fujitsu2171.8%
Vyatta2041.7%
Renesas Technology1521.3%
NTT1211.0%
MontaVista1151.0%
HP1070.9%
Wolfson Microelectronics1050.9%
(Academia)1020.9%
Nokia980.8%
XenSource910.8%
By lines changed
(Unknown)18141313.6%
Novell16422912.3%
(None)1180958.9%
Intel860606.5%
Red Hat739545.5%
LinSysSoft Technologies647984.9%
ADDI-DATA GmbH434203.3%
SofaWare392452.9%
Broadcom319562.4%
AMD283642.1%
Entropy Wave259051.9%
IBM257021.9%
Oracle255881.9%
NTT252351.9%
Neterion234951.8%
LSI Logic223041.7%
Atheros Communications216271.6%
(Consultant)192091.4%
Freescale161391.2%
PetaLogix158461.2%

These numbers are somewhat similar to those seen in previous development cycles. There are a few unfamiliar companies here; they are pretty much all present as a result of contributions to -staging. It is interesting to note that Atheros and Broadcom, once known as uncooperative companies, are increasing their contributions over time.

Your editor has not looked at signoff statistics for the last few cycles. The interesting thing to be found in Signed-off-by tags is an indication of who the gatekeepers to the kernel are. Especially if one disregards signoffs by the author of each patch, what remains is (mostly) the signoffs of subsystem maintainers who approved the patches for merging. For 2.6.30, these numbers look like this:

Top non-author signoffs in 2.6.30
Individuals
David S. Miller121612.1%
John W. Linville8658.6%
Ingo Molnar8368.3%
Greg Kroah-Hartman7977.9%
Mauro Carvalho Chehab7847.8%
Andrew Morton6606.6%
James Bottomley2502.5%
Linus Torvalds2192.2%
Len Brown1891.9%
Takashi Iwai1651.6%
Jeff Kirsher1451.4%
Russell King1271.3%
H. Peter Anvin1201.2%
Mark Brown1151.1%
Jesse Barnes1111.1%
Benjamin Herrenschmidt1111.1%
Reinette Chatre1041.0%
Martin Schwidefsky950.9%
Avi Kivity910.9%
Paul Mundt890.9%
Employers
Red Hat426442.4%
Novell138613.8%
Intel9519.5%
Google6606.6%
(None)4084.1%
IBM3783.8%
Linux Foundation2192.2%
(Consultant)1661.6%
(Unknown)1271.3%
Wolfson Microelectronics1151.1%
Renesas Technology920.9%
Marvell910.9%
Atomide810.8%
Oracle800.8%
Astaro650.6%
Freescale630.6%
Cisco610.6%
Analog Devices600.6%
Univ. of Michigan CITI590.6%
Panasas580.6%

Signoffs have always been more concentrated than contributions in general. Still, one wonders how David Miller manages to approve a solid twenty patches every day. On the employer side, things are more concentrated than ever; over half of the patches going into the kernel go through the hands of a developer at Red Hat or Novell. Developers, it seems, work for a great many companies, but subsystem maintainers gravitate toward a small handful of firms.

All told, the picture remains one of a well-oiled, fast-moving development process. We also see a picture of a -staging tree which is growing at a tremendous rate; your editor is tempted to exclude -staging patches from future reports if the rate does not slow somewhat. Even without -staging, though, a lot of work is being done on the kernel, with the participation of a large group of developers, and it doesn't look like it will be slowing down anytime soon.

Postscript: Jan Engelhardt sent your editor a pointer to a short script which, through use of the git blame command, tallies up the "ownership" of every line in the kernel. The top results for 2.6.30-rc7 look like this:

Who last touched kernel code lines
LinesPctWho
406372335.17% Linus Torvalds
4640214.02% Greg Kroah-Hartman
942000.82% David Howells
860310.74% David S. Miller
826080.71% Luis R. Rodriguez
722000.62% Bryan Wu
701280.61% Takashi Iwai
668590.58% Ralf Baechle
557850.48% Hans Verkuil
540690.47% Paul Mundt
540070.47% Kumar Gala
532880.46% David Brownell
516400.45% Russell King
506110.44% Paul Mackerras
494990.43% Andrew Victor
493470.43% Mauro Carvalho Chehab
492560.43% Alan Cox
473050.41% Mikael Starvik
470400.41% Ben Dooks
443070.38% Benjamin Herrenschmidt

Linus shows a high ownership because he was the initial committer at the beginning of the git era. To a rough approximation, one can conclude that approximately one third of the code in the kernel has not been touched since that time. There are other interesting things which can be done with line-level statistics; your editor plans to explore this idea some in the future.

Index entries for this article
KernelReleases/2.6.30


to post comments

Only 1/3rd!

Posted May 28, 2009 9:00 UTC (Thu) by alex (subscriber, #1355) [Link] (3 responses)

That's an interesting number. I wonder if the "blame" for Linus (or possibly
more correctly the initial commit) can be trended to show when the kernel
will have been fully re-written PG (Post GIT).

Only 1/3rd!

Posted May 28, 2009 13:40 UTC (Thu) by nix (subscriber, #2304) [Link] (2 responses)

It could only do that reliably if he wasn't writing any code at all anymore.

Only 1/3rd!

Posted May 28, 2009 16:29 UTC (Thu) by incase (guest, #37115) [Link] (1 responses)

That's one reason I usually do initial commits with a distinct user id on any repository.
That makes it easy to see which code was in the initial commit (and would therefor possibly need to be checked for ownership in an older repository).

The cleaner solution of course is to get the hostoric information from the old repository also into the new one, but that doesn't always work (as, probably, during the BitKeeper->git move).

Only 1/3rd!

Posted Jun 27, 2009 12:54 UTC (Sat) by jengelh (guest, #33263) [Link]

There is no need to do initial commits with a separate user id. I just need to adjust the scriptÂ…

Total lines: 11934488
  #1     4246304  35.58%  *initial checkin
  #2      230061   1.93%  Greg Kroah-Hartman
  #5       85543   0.72%  David S. Miller
#178       10621   0.09%  Linus Torvalds
#229        8424   0.07%  Andrew Morton

Developer statistics for 2.6.30

Posted May 28, 2009 16:30 UTC (Thu) by iabervon (subscriber, #722) [Link]

It might be interesting to look at the non-author sign-offs by person as a percentage of changes, rather than as a percentage of non-author sign-offs. It's not as interesting to know how much of the total gatekeeping is done by someone as to know how much of the total change is through them. In this case, 10.4% of the changesets got David Miller's sign-off in transit.

Canonical?

Posted May 29, 2009 18:55 UTC (Fri) by mikov (guest, #33179) [Link] (18 responses)

I hope I can ask this without sounding trollish - is Canonical present at all in these statistics? Do they do any kernel work?

Please note that I am not criticizing, just asking. Everybody is free to use whatever business model they can with Linux.

I have to admit though that I am a little concerned that if Canonical is taking customers away from RedHat and Novell (both major kernel contributors), it could ultimately hurt the kernel.

Canonical?

Posted May 29, 2009 19:43 UTC (Fri) by dlang (guest, #313) [Link] (7 responses)

they don't do much kernel development, they focus on the userspace stuff.

remember that opensource isn't a zero-sum game. yes ubuntu takes some customers away from RedHat, but they also get a lot of people using linux who would not have used RedHat.

some small percentage of these additional people that they get involved with linux will become kernel developers someday, so even with no direct payments to kernel developers they still benifit the kernel

Canonical?

Posted May 30, 2009 1:12 UTC (Sat) by mikov (guest, #33179) [Link] (5 responses)

Thanks for the reply. I generally agree with what you are saying - more Linux adoption is a good thing, and of course there is neither a law, nor an ethical requirement, saying that everybody has to contribute to the kernel.

However there is a secondary observation that has been bugging me for some time: in light of these statistics, it seems less likely that Canonical would be able to provide high quality kernel support with their support licenses.

Canonical?

Posted May 30, 2009 10:06 UTC (Sat) by kragil (guest, #34373) [Link] (3 responses)

Canonical employs a few very capable kernel hackers. Sure they don't contribute as much as the RH or Novell guys, but maybe that it because their time is spent providing high quality kernel support. Think about it ;)

Canonical?

Posted May 30, 2009 22:19 UTC (Sat) by mikov (guest, #33179) [Link] (2 responses)

If they provide kernel support, wouldn't it result in patches, which then would have to visible in the above statistics?

(BTW, I want to reiterate that I have nothing against Canonical, and in fact at one time our company was considering purchasing support. Alas, instead there was a company-wide downgrade from Kubuntu to Windows XP for all non-developers. That however is a different subject...)

Canonical?

Posted Jun 1, 2009 5:50 UTC (Mon) by rahulsundaram (subscriber, #21946) [Link] (1 responses)

Support can either mean hand holding (ie) do foo to accomplish bar or it can be mean prioritized bug fixes or new features and if it is the latter and the vendor is pushing those fixes upstream, then it would show up in these statistics.

Canonical?

Posted Jun 10, 2009 19:25 UTC (Wed) by jengelh (guest, #33263) [Link]

Well handholding seems scarce if I have to look at some Launchpad projects, so there is not much left.

Canonical?

Posted Jun 10, 2009 7:18 UTC (Wed) by SimonO (guest, #56318) [Link]

Perhaps it doesn't matter too much either way? Kernel contributors tend to be individuals rather than companies, so if one company would have to lay off people, other linux companies who have more success can hire them.

It would be worrying if the company which is more successful would not allow upstream contributions.

/Simon

Canonical?

Posted Jun 2, 2009 12:55 UTC (Tue) by nix (subscriber, #2304) [Link]

Even if it was a zero-sum game, the kernel works quite well and gets
vastly more development attention than any other component of the system.
Perhaps it would be a good thing if other components (e.g. X) got lots of
new developers :)

Canonical?

Posted Jun 1, 2009 23:10 UTC (Mon) by martinfick (subscriber, #4455) [Link] (5 responses)

Do you have any evidence that they are taking away RH or Novell customers? I think it's more likely that all 3 are mostly growing new customers.

Canonical?

Posted Jun 2, 2009 18:24 UTC (Tue) by mikov (guest, #33179) [Link] (4 responses)

Yeah, I have lots of "evidence"; that is what I do all day - gather evidence to implicate Canonical in trying kill Linux :-)

Seriously, I think these are valid questions to ask and discuss if we do it politely. The consensus between those who responded is that Canonical is benefiting the Linux ecosystem and I agree.

I do however think think that there are two different alternate realities here. One is the overenthusiastic reality which you see at tech sites and discussion forums (e.g. Slashdot), and the other is the actual physical reality.

For example In the physical reality I have never met a person who actually uses Ubuntu. When I say "uses", I don't mean install every new version and "try" it, but actually use it for everything on his home/work PC every day. I am sure such people exist, but they are much fewer in absolute numbers than one would assume by reading tech sites. (I myself actually _do use_ Ubuntu on my laptop 100% of the time, but people think I am crazy. We also tried to use it on desktops at work, but alas that failed).

On the other hand, I have met many people who have a Ubuntu Live CD, or a partition, which they boot probably once a month. This is a fake user base. Their primary OS with very few exceptions is Windows. These are probably the people who upgrade their Ubuntu every 6 months - I am sure that nobody who actually uses their computer would be crazy enough to do an upgrade so frequently.

So, I think the answers are more complicated.

Canonical?

Posted Jun 2, 2009 21:33 UTC (Tue) by dlang (guest, #313) [Link]

I know a couple dozen people who use ubuntu as their distro, all day long.

I think it is going to depend a lot on what groups you are with as far as how many people use what distro.

I used slakware for 10+ years so I'm not a non-technical user, and I decided to switch to ubuntu.

Canonical?

Posted Jun 4, 2009 7:29 UTC (Thu) by Lennie (subscriber, #49641) [Link]

Well, it all depends how you look at it, if webstatistics are anything to go by, Linux has something like 1% of the desktop. That means 1 in a 100 is a Linux-desktop. Have you seen what a 100 desktops are running in the past months ? I know I haven't. Actually, possible not even in years (ok, if I don't count the LUG-meeting I visited). I also know their are a lot of people who are like minded and they use Linux 'together'. So these Linux desktops also are clustered together. That means it's even less likely you'll see a Linux desktop in the wild. I use it every day and I know 5 people who use it atleast a few times a week.

Canonical?

Posted Jun 5, 2009 2:22 UTC (Fri) by Duncan (guest, #6647) [Link]

> On the other hand, I have met many people
> who have a Ubuntu Live CD, or a partition,
> which they boot probably once a month. This
> is a fake user base. Their primary OS with
> very few exceptions is Windows.

Agreed in general that these aren't true users, and that there are
probably a lot of Ubuntu LiveCDs and installs in that group. However,
it's important to note that the *ix way of computing is a decently large
change in computing mindset, and that some portion of these will
ultimately become full-time Linux users.

(FWIW, I was in this segment for 2-3 years, around the turn of the
century, with Mandrake the distribution I was playing with. Then MS
decided they were going a different way than I was, and basically gave me
that last push I needed to make the jump, when they went the eXPrivacy and
remote authorization route. This for a user who had previously considered
most MS software "too important" to risk warezing, and who had previously
been spending probably 50% of his computing dollars on MS directly. But
after the push, I soon discovered how liberating the land of freedomware
was, and now look back at proprietaryware much like a defector looking
back at his former home -- I have a lot of friends and family I left
behind and will do what I can to help them make the jump as well, but
it's nowhere I want to be or can even visit, unless the regime changes and
becomes free as well. I no longer even consider proprietaryware, nor
could I without serious legal issues, as I can no longer agree to all
the !#@! demanded of proprietaryware users, whose masters, those supplying
the proprietaryware, really /do/ seem to think of them as slaves, not
actual human beings, with few if any rights worth considering.)

> These are probably the people who upgrade
> their Ubuntu every 6 months - I am sure
> that nobody who actually uses their computer
> would be crazy enough to do an upgrade so
> frequently.

Why not? I actually use my computer, running Gentoo, and upgrade on
average 2-3 times a week. In fact, I hate going a full week without an
upgrade as the changes start getting too large to easily cope with all at
once. Thus, I'd argue that six month upgrades, far from being too
frequent, are WAYYY too infrequent, by three orders of magnitude!

FWIW, I found the same issue on Mandrake and quickly switched to Cooker,
where the rolling updates were MUCH easier to cope with and most of the
big issues the release version upgraders had to cope with were incremental
changes I had dealt with as a matter of course tiny incremental bits at a
time, long months before. Six months is simply WAY too long for easy
upgrades, and going longer than that, you're needlessly losing out on
updates and features that make computing both easier and more pleasant.

Duncan

Canonical?

Posted Jul 21, 2009 3:34 UTC (Tue) by jmm82 (guest, #59425) [Link]

I have been using Ubuntu for two years, exclusively. Before Ubuntu I used Red Hat.

I work at a start-up with about 10 Linux users and all but one use Ubuntu(the other uses Suse). Everyone I know that uses it are happy and yes we do *real* Linux programming stuff.

Canonical? No.

Posted Jun 22, 2009 20:14 UTC (Mon) by asdlfiui788b (guest, #58839) [Link] (3 responses)

Canonical does not contribute upstream to Linux, at all. Whatsoever. They just take what they can scrap up from Debian, and make money on it.

Canonical? No.

Posted Jun 22, 2009 22:32 UTC (Mon) by mjg59 (subscriber, #23239) [Link]

It's entirely untrue that Canonical contribute nothing to upstream. It can certainly be argued that their contributions are small relative to their profile and developer count, but that's not the same thing.

Canonical? No.

Posted Jun 29, 2009 20:17 UTC (Mon) by job (guest, #670) [Link] (1 responses)

You mean they lose money on it. :)

I will save any judgement on Canonical or any other company until they have black figures. Until then, the process is not yet sustainable and matters little in the long run.

Canonical? No.

Posted Aug 12, 2009 19:22 UTC (Wed) by snadrus (guest, #60224) [Link]

Important point!
If [their organization cannot persist (which is true if they live in the red)] then
anyone requesting more from them could at-best see short-term benefits until they run out of cash reserves.

They've enhanced the open source desktop with Upstart & invited more independent developers to open source with Quickly and Launchpad.

scripts available?

Posted Jun 11, 2009 15:46 UTC (Thu) by khilman (subscriber, #37671) [Link] (1 responses)

Are the scripts/tools used to generate these stats publicly available?

scripts available?

Posted Jun 11, 2009 18:10 UTC (Thu) by jake (editor, #205) [Link]

> Are the scripts/tools used to generate these stats publicly available?

yup, the repository is here: git://git.lwn.net/gitdm.git

This message might be of interest as well: http://lkml.org/lkml/2009/1/14/452

jake


Copyright © 2009, Eklektix, Inc.
This article may be redistributed under the terms of the Creative Commons CC BY-SA 4.0 license
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds