Sunday, June 25, 2006
CVSAnalY_SF update - project unixname
Today we got notice from the CVSAnalY folks that their data now maps to ours via unix projectname. (CVSAnalY_SF is a project that mines data from CVS repositories, such as those for projects hosted on Sourceforge.)
He writes:
So, if you've been using our FLOSSmole data, you know that each Sourceforge project has a unique project "unixname". Well, you can now grab our data, and grab the CVSAnalY_SF data, and use them together to create even more interesting data sets.
He writes:
You can find data, schema, explanation and known bugs at:
http://libresoft.urjc.es/Data/CVSAnalY_SF
this data set may be more interesting to you as it includes now the
project table which allows to know the unix_name of the project (so that
you can link data from here with FLOSSMole, among others).
So, if you've been using our FLOSSmole data, you know that each Sourceforge project has a unique project "unixname". Well, you can now grab our data, and grab the CVSAnalY_SF data, and use them together to create even more interesting data sets.
Sunday, June 04, 2006
June 2006 Sourceforge data released
This has to be a record! It's only the 4th of June and already the files have been posted. Hooray for 28 machines working on the alphabet! Hooray for not procrastinating!
Pick up the files from our FLOSSmole file release page on Sourceforge.
Here's what's included:
Package: sfProjectInfo
Release: sfProjectInfo01-Jun-2006
Package: sfRawDeveloperData
Release: sfRawDeveloperData01-Jun-2006
Package: sfRawData
Release: sfRawData01-Jun-2006
If you're going to use the Query Tool instead of the raw data files, please read these handy tips first. And as always, the "how-to" for using our data is available too.
Pick up the files from our FLOSSmole file release page on Sourceforge.
Here's what's included:
Package: sfProjectInfo
Release: sfProjectInfo01-Jun-2006
Files:
--ProjectList01-Jun-2006.csv.bz2: list of just project names
--ProjectInfo01-Jun-2006.csv.bz2: list of all basic project info (i.e. number of developers, registration dates, etc)
--ProjectDescriptions01-Jun-2006.csv.bz2: project names and their text descriptions (this file is quite large)
Package: sfRawDeveloperData
Release: sfRawDeveloperData01-Jun-2006
Files:
--sfRawDevelopers01-Jun-2006.csv.bz2: list of all developers
--sfRawDevProjects01-Jun-2006.csv.bz2: list of which projects are worked on by which developers
Package: sfRawData
Release: sfRawData01-Jun-2006
Files:
--sfRawDbEnvData01-Jun-2006.csv.bz2: list of projects and their database environments
--sfRawDonorData01-Jun-2006.csv.bz2: list of projects and their donors
--sfRawIntAudData01-Jun-2006.csv.bz2: list of projects and their intended audiences
--sfRawLicenseData01-Jun-2006.csv.bz2: list of projects and their open source licenses
--sfRawOpSysData01-Jun-2006.csv.bz2: list of projects and their operating systems
--sfRawProgLangData01-Jun-2006.csv.bz2: list of projects and their programming languages
--sfRawStatusData01-Jun-2006.csv.bz2: list of projects and status
--sfRawTopicData01-Jun-2006.csv.bz2: list of projects and their topics
--sfRawUserIntData01-Jun-2006.csv.bz2: list of projects and their user interfaces
If you're going to use the Query Tool instead of the raw data files, please read these handy tips first. And as always, the "how-to" for using our data is available too.