Herrmann, J.B. wrote:
> [...] I am also interested in using the web as a corpus - does
> anybody know of a way of filtering for youth language?
I would use a forum/newsgroup/chat system for young people. And if "web" does not need to be www: you could e.g. log an IRC chatroom. This is technically easy because there are command line tools available so that you could pipe the traffic into a file.
If you are not used to IRC, I remember the IRCNet to be a network with a lot of well visited channels. A starting point could be http://irc.netsplit.de/channels/?net=IRCnet
But: IRC is not representative for normal use of language ;-).