Language identification systems usually fail when we are analyzing short sentences, from the solutions based in ngram to solutions based in dictionaries, usually fail when analyzing short Sentences, but we've arranged a new technique based in stemming (porter) + stopwords, that does the job and works pretty fast.
In this article we share with you the results of this analysis and the times we get compared with google apis.
What's your opinion? Can Stemming + stopwords complete n-grams to solve the language analysis tools for short phrases or shall we forget about Twitter and just don't analyze those damn short messages.
Kind Regards.