Help transfer content and data into or out of the site, by migration, backup, or exposing data to external systems.

Scraper

For a modern, working example see the Example Web Scraper (built on Feeds and Feeds XPath Parser.

@TODO: For Drupal 6, use SimpleXML as done by Nick Lewis.

The currently maintained portion of this module is a very simple scraper that can take a URL and beginning and ending code and display the result in a block. (This could easily be extended to display on a page.)

It does not store this scraped data at the moment, therefore it is necessary to use the blockcache module and replace the scraped_content block with its cached equivalent. Otherwise the page you are scraping is called every time the scraped_content block is viewed.

4.7 only: This module also contains code from an industrial strength scraping module written for 4.7, which is not maintained, Legacy scraper.

See also Import HTML module.

RSSKey

The RSSKey module provides rss feeds for selected content (e.g. story or forum) and taxonomy, authenticated by a cryptographic key that is included in the feed url. This permits rss aggregators to get useful feeds from a drupal installation that does not allow
anonymous users to view content, or that varies content permitted by user.

Syndication2

Ability to allow readers to add your feed right into readers like MyYahoo, Google, and other popular readers.

Import / Export API

The import / export API allows for the definition of data entities within Drupal.

Google Data Integration

This module is currently being revamped as a 6.x contributed module.

Plans at the moment include:

Pages

Subscribe with RSS Subscribe to RSS - Import and export