I wonder where I can found corpora with texts representing different varieties of a language (sociolinguistically seen, with different code/subcodes of a language). For example, a jargon, slang, dialect, thievish/larcenous corpus? The language is not very important :)
I know this mailing list is generally about corpora but maybe somebody can remember on some striking examples...
Best Alexander -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 491 bytes Desc: not available URL: <http://www.uib.no/mailman/public/corpora/attachments/20111102/ca51b3d8/attachment.txt>