[Corpora-List] Source code corpora

Klaus Guenther klaus.guenther at split.uni-bamberg.de
Thu Nov 20 20:43:34 CET 2008

It may be interesting to note that some projects have an automated coding standard enforcement. Due to the wide range of compliance in the aforementioned PEAR repository, a tool was developed to automatically parse code to ensure compliance with certain standards. The package is called PHP_CodeSniffer [1] and apparently works just fine. There is also a proposal page with comments [2].

Regards, Klaus

[1] http://pear.php.net/package/PHP_CodeSniffer [2] http://pear.php.net/pepr/pepr-proposal-show.php?id=426

Alexandre Rafalovitch schrieb:
> On Thu, Nov 20, 2008 at 2:21 PM, Klaus Guenther
> <klaus.guenther at split.uni-bamberg.de> wrote:
>> So the main issue is finding code that can reliably be attributed to an
>> author in an unmodified form and discovering details that are not
>> attributable to the project's coding standard. I know of no such corpus.
> This sounds like an interesting pre-condition research project then,
> as an inversion of 'keeping to the coding standards'.
> Take a set of source code repositories and determine whether all
> contribution are bellow or above the threshold of similarity.
> Something with self-organisation, perhaps, and then comparing number
> of clusters with number of actual developers.
> Personal blog: http://blog.outerthoughts.com/
> Research group: http://www.clt.mq.edu.au/Research/
> Hmm.
> Regards,
> Alex.
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

More information about the Corpora mailing list