[Corpora-List] Source code corpora

Eric Atwell eric at comp.leeds.ac.uk
Thu Nov 20 11:11:37 CET 2008


I also seek source code corpora, but with English-language requirements specifications accompanying each program; for a PhD project on mapping from English specification to formalism and/or code. Any pointers welcome.

Eric Atwell, School of Computing, University of Leeds

On Thu, 20 Nov 2008, sdb at cs.rmit.edu.au wrote:


> Dear colleages,
>
> My research relates to authorship attribution of source code (that is,
> determining the owner of anonymous work samples based upon other work
> samples where authors are known).
>
> I'm looking for recommendations for source code corpora for this task
> for any programming language. For the corpora to be useful, authorship
> has to be identified.
>
> My work to date has involved student programming assignments, and I'm
> now interested in other sources such as industry and open-source
> projects.
>
> Many thanks,
>
> ---------------------------------------------------
> Steven Burrows
> PhD Candidate, Sessional Lecturer
> School of Computer Science & Information Technology
> RMIT University
> GPO Box 2476V, Melbourne VIC 3001, Australia
> o: 14.09.04
> p: +(61 3) 9925 2758
> f: +(61 3) 9662 1617
> e: steven.burrows at rmit.edu.au
> w: www.cs.rmit.edu.au/~sdb
>



More information about the Corpora mailing list