Last week, we learnt about Lemmatization. (Post: What Is Lemmatization?) Today, we’ll be focusing on the actual application of the Lemmatizer API.
English is one of the most widely used language in the world, having over 335 millions native speakers. (Source: Ethnologue, 2014) Taking into consideration this number only represents people speaking English as a first language, the actual number of English speakers (including people adopting English as a second/third language) far exceeds the figure mentioned above.
Depending on where you are from, the English might be a little different. Accent and slang aside, there are times when a word can have two different spelling; the British English or the American English way of spelling. Listing a few examples:
How can we program the computer to recognize the same word with different spellings? The solution is the integration of the Lemmatizer API. Not just simply returning words to their root form, the Lemmatizer API also recognizes “Colour” = “Color”, “Organise” = “Organize”.
PS: Did you know that “is”, “was” and “were” are inflected forms of “be”?
2 Comments
Can I get this Lemmatizer API freely…. how can i get this and how can i apply this API to my own program. Please help me regarding this…( i just tried Lemmatization with R but i didn’t got desiedred output
Hi @BHANUMATHIHM, you can try out the Lemmatizer API on the demo page. There you can see how it works and an explanation of how each option controls the output. You can also check out this post about controlling the output.
Our APIs are listed on the Mashape API Marketplace and are RESTful APIs. This Lemmatizer API is free so all you have to do is just sign up on Mashape to get your free API key and the endpoint info. Please feel free to comment again if you have any questions. You can also email us at [email protected] if you need more specific help.
Good luck!