This project is not covered by Drupal’s security advisory policy.

Migrate Process XML is a handy module that reads XML and allows you to extract particular key sections using XPath.

See the README for more details.

Usage

This module provides the following migration process plugins:

  • xhtml: Attempts to convert permissive HTML into strict XHTML. Useful as a first step in a process pipeline.
  • xpath: Which allows you to run an xpath query against a field value, returning an XML string.
  • xvalue: Same thing as xpath, but returns only the body value of the query rather than XML.

Use like any other process plugin:

  field_html:
    source: html
    plugin: xhtml
  field_title:
    source: '@field_html'
    plugin: xvalue
    xpath: '//*[local-name()="TITLE"]'
  field_subtitle:
    source: '@field_html'
    plugin: xpath
    xpath: '//*[local-name()="h1"]'

This Module is Part of a Migration Framework

This module is part of a migration framework created for a client project in which we migrated information from Digital Measures to a Drupal site. However, the specific problems each module solves are common to migration processes in general. With the exception of Digitalmeasures Migrate, these modules are fairly generic and can be used for a variety of other migrations.

For more information on how this framework was used for a Digital Measure Migration, read:

Here are the modules in the migration framework:

  • Digitalmeasures Migrate
    Provides a method to access Digital Measures API through Drupal.
  • Migrate Process XML
    Reads XML and allows you to extract particular key sections using XPath.
  • Migrate Process S3
    Allows you to download objects from an S3 bucket as files to your Drupal site.
  • Migrate Process URL
    Allows you to manipulate URL values that are provided within the data.
  • Migrate Process Regex
    Provides a way to use Regular Expressions in a Drupal migration.
  • Migrate Process Vardump
    Often used for debugging, this module takes any data given to it and dumps it to the terminal output and then passes it on.
  • Migrate Process Skip
    Provides a few different mechanisms to define what is “empty” and should be skipped.
  • Migrate Process Trim
    This module provides a quick and simple means to remove leading or trailing characters (such as spaces) in a Drupal migration.
  • Migrate Process Array
    Provides utilities for interacting with arrays during migration.
Supporting organizations: 
Development time

Project information

Releases