This is the personal website of John Watson: father, software developer, artist, guitar player. Follow me on Mastodon or Twitter or Twitch or itch.io or GitHub.

Big Huge Thesaurus API

I’ve just rolled out an API for the Big Huge Thesaurus. Need synonyms? Give the free API a try.

Comments

  1. dmitry sherman on 2008-02-29 10:14:42 wrote: Hey John, I’ve just tested the API or actually the dictionary itself; it has some lacks… it doesn’t strip the present progressive suffix ‘ing’ for some of the words(for instance ‘hosting’) and it doesn’t strip the past tense suffix ’ed’(i.e ‘played’) (it’s only what i checked). Sometimes it provides results and I guess it happends because of manually added word forms. (‘working’). For comparison the Oxford version in Babylon which which do the job. I thinkabout some client side processing for stripping, but what about the words that have special meaning in tense inflection ? Or what about the words that were already added?

  2. John on 2008-02-29 11:39:16 wrote: Good points. Thanks for letting me know your thoughts. I’ll probably add some of your suggestions especially re: tenses. I’m not sure about the ings though. Ideally each of those unique versions would be added to the dictionary. That is, hosting should be in the dictionary rather than just returning the listing for host.

  3. Avinash on 2008-03-09 21:53:33 wrote: *I would like to make a suggestion: it would be really cool for the XML version to group the results into nouns, verbs etc and returns that to the user. eg. data1 data2 data3 data4 *

  4. John on 2008-03-10 07:05:18 wrote: I’ve got that data. The reason it’s like this is that I was starting with the plain text output and didn’t want to complicate it and I wanted all the various output formats to match. For the XML output, I could add p=“noun” as an attribute for each word element. But for the plain text output? word,partofspeech? I don’t know. Then again, maybe the different formats don’t have to match…

  5. Rob on 2008-03-26 08:44:12 wrote: Very cool. It would be useful to me (read: willing to pay for access) if it did the following: 1) Separated by type (noun, verb, adverb, etc.) 2) Separated by tense (past, future, -ing, etc.) 3) Prioritized by closest match to original keyword In other words, if I enter the word ‘recognize’, it returns a list like: accept accost accredit acknowledge address agnise agnize appreciate be call back call up certify cognise … It would be great if the ones at the top of the list were things like: acknowledge accredit appreciate Not sure how possible that would be, but I have my wallet out!

  6. John on 2008-04-02 17:59:07 wrote: For those of you following along, the API now: - searches harder to find a match by dropping common endings (ing, ed, s) when it can’t find an exact match. - has better error codes in the form of HTTP response codes - has a version 2 of the API with part of speech data - sorts results by “most common” words first And you can now purchase upgraded service directly from the site. See the API key request page for pricing.