[Corpora-List] converting non-embedded tags into embedded ones

Lou Burnard lou.burnard at computing-services.oxford.ac.uk
Sun Feb 24 13:32:26 CET 2008


If your input is well-formed XML (which it looks like, except that there should be quote marks round the NN) then the easiest way would be to use an XSLT script containing a template like this:

<xsl:template match="lex"> <xsl:value-of select="."/>_<xsl:value-of select="@pos"/> </xsl:template>

You can process such a script with (for example) xsltproc which is available for windows or unix

Warren Tang wrote:
> Could someone help me with this problem:
>
> I have texts with non-embedded tags:
>
> eg: <lex pos=NN>time</lex>
>
> but I would like to convert them to embedded tags (if this is the
> right term):
>
> e



More information about the Corpora mailing list