Open Calais, by Thomson Reuters

Note: I am aware this page needs a re-write. As of 5/24/2016 the -dev branch no longer requires the ARC2 library. I am in need of testers for this as it is relying on the JSON feed and a lot of code had to be refactored accordingly. I will have an update of this page (read: overhaul) by June 1)

What is it?

The Calais Collection is an integration of the Thomson Reuters' Calais web service into the Drupal platform. The Calais Web Service automatically creates rich semantic metadata for the content you submit – in well under a second. Using natural language processing, machine learning and other methods, Calais analyzes your document and finds the entities within it. But, Calais goes well beyond classic entity identification and returns the facts and events hidden within your text as well. The web service is free for commercial and non-commercial use. It requires registration to obtain an API Key.

Read webchick's fantastic Introduction to Calais for Drupal.

What's New?

  • Working Drupal 6 version. Drupal 6 & 7 Updated for OpenCalais recent changes
  • Upgraded to work with Calais release 4.3
  • SocialTags integration
  • Full Calais data integration with Views
  • Full support for Calais disambiguated URIs and data for Geo (City/State/Country), Company, Products

API

This module provides a flexible API for modules to use when integrating with the Calais Web Service. There is a function based and an object oriented API.

Tagging Integration with Nodes and Taxonomy

This provides the capability to integrate Calais Entity, Event, and Fact metadata with Drupal Nodes. The Calais module lets you configure which Content Types should be analyzed by Calais for metadata extraction on update. The metadata returned can then be automatically assigned to vocabularies, or it can only suggest terms allowing full user control of the tagging (think of del.icio.us recommending tags). A flexible set of hooks allows 3rd party modules to make modifications before or after Calais metadata has been processed and applied.

Geomapping

The Calais Geo module allows for plotting various Calais Vocabulary terms on a map, as provided by the GMap module. It makes use of the Calais Web Service facilities that provide latitude & longitude for relevant geo terms. Map data is provided as a block and as part of the node properties.

Installation Notes

The ARC2 library is required for this module to function.

  • The D6 version also requires ARC2, however it should be installed as part of the RDF module, which is a dependency
  • The D6 Version 2.2+ requires the RDF module alpha5 release or later

The Taxonomy Manager module is not required, but can make your life much easier in the event that large amounts of unique terms are applied to your content.

Calais Collection

Also part of the collection is

Drupal 6 Maintained

If you are wanting Drupal 6 maintenance releases, I am working on those but at a different pace based on demand. Please be sure to drop me a note if you are interested in future D6 builds. They will be available on GitHub as there is no way to release them on Drupal.org any longer.

Refactor & Drupal 8 Version Coming

As part of Drupal 8 development, Drupal 7 and 8 will be a complete re-write of the OpenCalais/PERM ID integration with many more options. Follow @mbagnall17 on Twitter for development details and updates.

Credits

Updated to work with the new Open Calais by ElusiveMind at FlyingFlip Studios, LLC.

Project Information

Downloads