Monday, April 25, 2005

 

April 2005 Summary Reports

The April 2005 Summary Reports have been posted:

Old reports:

 

How to use this data

(Note: This message is updated periodically with new info.)

The FLOSSmole project provides data about:

(a) all projects on Sourceforge
(b) all developers on Sourceforge
(c) all projects on Sourceforge AND who is developing for them, their roles, whether they are an administrator, etc.
(d) all Sourceforge projects and their programming languages, operating systems, user interfaces, end user audience, registration dates, etc (new: donations!)
(e) Edit, Oct-2005: much of the above, but for Freshmeat, also
(f) Edit, Jul-2006: also, Rubyforge
(g) Edit, Jul-2006: also, Objectweb
(h) Edit, Jan-2007: also, Free Software Foundation directory
(i) Edit, Feb-2007: also, SourceKibitzer donates data

We have done runs on Sourceforge starting in early 2004 and we have received donated Sourceforge data for December 2004 from Dawid Weiss in Poland.

We began also scraping Freshmeat, Rubyforge, and Objectweb, and we receive data from SourceKibitzer. Get the complete list of data sources here. (This is a list of each of our scrapes and the date and it's "datasource" ID.) The abbreviations for the forges are RF (Rubyforge), SF (Sourceforge), FM (Freshmeat), OW (Objectweb), FSF (Free Software Fndn Directory), SK (SourceKibitzer).

We are now collecting information from Sourceforge every 60 days, and from Freshmeat/Rubyforge/ObjectWeb/FSF/SK every 30 days (monthly).

You can get all the raw data files from the project file release system.

In addition to the text files (database dumps), we have a basic query tool. Details and tips for using the query tool are available here.

Hope this helps, and please contact me at any time (mconklin AT elon DOT edu) to discuss the data or what is missing, what you'd like to do with it, etc.

Sunday, April 24, 2005

 

April 2005 Raw Data Released

I've released the raw data files for April 2005 Sourceforge scrape.
  1. Get the Raw List of Projects (full list of SF projects, registration dates, etc)
  2. Get the Raw Project Data (includes operating systems, programming languages, etc)
  3. Get the Raw Developer Data (includes developer list and developer-projects list, with new administrative flag!)

This is good stuff! Summary reports coming soon.

Saturday, April 23, 2005

 

data donations

Thanks to all who have donated and used FLOSSmole data. Here is a short explanation of who has collected data from us so far:


Datasource_IDDonation Notes
1Sourceforge full project data collected October 2004 by Megan Conklin.
2Sourceforge full project data collected December 2004 by Dawid Weiss, donated and imported March 2005.
3Sourceforge full project data collected January 2005 by Megan Conklin.
4Sourceforge full project data collected April 2005 by Megan Conklin.
5Sourceforge full project data collected July 2005 by Megan Conklin.
6Sourceforge project data collected 2001-02-03 by Kevin Crowston, parsed and loaded by James Howison July 2005.
7Sourceforge project data collected 2002-05-02 by Kevin Crowston, parsed and loaded by James Howison July 2005.
8October 2005 SF run
9Freshmeat, June 2005
10Freshmeat, June 2005
11Freshmeat, October 2005
12 Freshmeat, November 2005
13Sourceforge, December 2005
14Freshmeat, December 2005
15Freshmeat, January 2006
16 Sourceforge, February 2006
17Freshmeat, February 2006
18Freshmeat, March 2006
19Sourceforge, April 2006
20 test Rubyforge run
21Freshmeat, May 2006
22Sourceforge, June 2006
23 Freshmeat, June 2006
24Rubyforge, July 2006
25Freshmeat, April 2006
26 Freshmeat, July 2006
27ObjectWeb, August 2006
28Sourceforge, August 2006
29Freshmeat, August 2006
30Rubyforge, August 2006
31Rubyforge, September 2006
32Objectweb, September 2006
33Freshmeat, September 2006
34Sourceforge, October 2006
35Rubyforge, October 2006
36Objectweb, October 2006
37Freshmeat, October 2006
38Sourceforge, December 2006
39Rubyforge, December 2006
40Objectweb, December 2006
41Freshmeat, December 2006
42Freshmeat, January 2007
43Rubyforge, January 2007
44Objectweb, January 2007
45Free Software Foundation, January 2007
46Sourceforge, February 2007
47Freshmeat, February 2007
48Rubyforge, February 2007
49Objectweb, February 2007
50Free Software Foundation, February 2007
51SourceKibitzer, February 2007
52Freshmeat, March 2007
53Rubyforge, March 2007
54ObjectWeb, March 2007
55Free Software Foundation, March 2007
56SourceKibitzer, March 2007
57SourceForge, April 2007
58Freshmeat, April 2007
59Rubyforge, April 2007
60ObjectWeb, April 2007
61Free Software Foundation, April 2007
62SourceKibitzer, April 2007

Saturday, April 09, 2005

 

Sourceforge Bug Tracker data and analysis scripts

Just wanted to put in a pointer to the data and scripts that we used for our recent First Monday paper, The social structure of Free and Open Source software development. This data is part of OSSmole and Megan and I are working away currently merging out databases. But it is available now on the Syracuse FLOSS research site if people want to jump in.

This page is powered by Blogger. Isn't yours?