<?xml version="1.0" encoding="UTF-8"?>











<rss version="2.0" xmlns:jf="http://www.jivesoftware.com/xmlns/jiveforums/rss">



<channel>
    <title>Support Forums: Message List - Language Identification System: How to recognize other languages than English</title>
    <link>http://www.theserverside.com</link>
    <description>Most recent forum messages</description>
    <language>en</language>
    
        <generator>Jive Forums Silver 5.5.30 (www.jivesoftware.com)</generator>
    
    <pubDate>Thu, 23 May 2013 07:22:26 -0400</pubDate>


    <item>

        <title>Language Identification System: How to recognize other languages than English</title>
        <link>http://www.theserverside.com/discussions/thread.tss?thread_id=62570</link>

        

        
            <description><![CDATA[<p>Analyzing short texts will always be a major headache as they contain so little information. Regarding the Twitter case, it may be a good idea to add multiple posts from the same user together. This way you get more words. And one user probably uses...]]></description>
        

        <pubDate>Fri, 01 Mar 2013 08:01:42 -0500</pubDate>

        

        <jf:creationDate>Fri, 01 Mar 2013 08:01:42 -0500</jf:creationDate>
        <jf:modificationDate>Fri, 01 Mar 2013 08:01:42 -0500</jf:modificationDate>
        <jf:date>Mar 1, 2013</jf:date>
        <jf:author>Nick Snels</jf:author>
        <jf:replyCount>0</jf:replyCount>
    </item>


    <item>

        <title>Re: Language Identification System: How to recognize other langua</title>
        <link>http://www.theserverside.com/discussions/thread.tss?thread_id=62570</link>

        

        
            <description><![CDATA[<p>"Are you using Levenshtein's distance?"</p>
<p>&nbsp;</p>
<p>I do. Sometimes, depending on ngram analysis :)</p>]]></description>
        

        <pubDate>Wed, 29 Jun 2011 12:48:20 -0400</pubDate>

        

        <jf:creationDate>Wed, 29 Jun 2011 12:48:20 -0400</jf:creationDate>
        <jf:modificationDate>Wed, 29 Jun 2011 12:48:20 -0400</jf:modificationDate>
        <jf:date>Jun 29, 2011</jf:date>
        <jf:author>Istvan Soos</jf:author>
        <jf:replyCount>0</jf:replyCount>
    </item>


    <item>

        <title>Re: Language Identification System: How to recognize other langua</title>
        <link>http://www.theserverside.com/discussions/thread.tss?thread_id=62570</link>

        

        
            <description><![CDATA[<p>Hi Istvan,</p>...]]></description>
        

        <pubDate>Wed, 29 Jun 2011 11:23:50 -0400</pubDate>

        

        <jf:creationDate>Wed, 29 Jun 2011 11:23:50 -0400</jf:creationDate>
        <jf:modificationDate>Wed, 29 Jun 2011 11:23:50 -0400</jf:modificationDate>
        <jf:date>Jun 29, 2011</jf:date>
        <jf:author>israel olalla</jf:author>
        <jf:replyCount>1</jf:replyCount>
    </item>


    <item>

        <title>Language Identification System: How to recognize other languages than English</title>
        <link>http://www.theserverside.com/discussions/thread.tss?thread_id=62570</link>

        

        
            <description><![CDATA[<p>Snowball might be good for some languages, stopwords for a different (slightly overlapping) set of languages, but in general languages, you need to find other solution. For example variable-sized ngram with the combination of a text-distance algorithm...]]></description>
        

        <pubDate>Wed, 29 Jun 2011 05:49:52 -0400</pubDate>

        

        <jf:creationDate>Wed, 29 Jun 2011 05:49:52 -0400</jf:creationDate>
        <jf:modificationDate>Wed, 29 Jun 2011 05:49:52 -0400</jf:modificationDate>
        <jf:date>Jun 29, 2011</jf:date>
        <jf:author>Istvan Soos</jf:author>
        <jf:replyCount>2</jf:replyCount>
    </item>


    <item>

        <title>Language Identification System: How to recognize other languages than English</title>
        <link>http://www.theserverside.com/discussions/thread.tss?thread_id=62570</link>

        

        
            <description><![CDATA[<p>Language identification systems usually fail when we are analyzing short sentences, from the solutions based in ngram to solutions based in dictionaries, usually fail when analyzing short Sentences, but we've arranged a new technique based in stemming...]]></description>
        

        <pubDate>Tue, 28 Jun 2011 07:56:03 -0400</pubDate>

        

        <jf:creationDate>Tue, 28 Jun 2011 07:56:03 -0400</jf:creationDate>
        <jf:modificationDate>Tue, 28 Jun 2011 07:56:03 -0400</jf:modificationDate>
        <jf:date>Jun 28, 2011</jf:date>
        <jf:author>israel olalla</jf:author>
        <jf:replyCount>4</jf:replyCount>
    </item>



</channel>
</rss>

