I'm trying to import this full catalog (official news subject tagging system):
http://nets.ii.uam.es/neptuno/iptc/iptc-srs.rdfs

On import, it recognizes 8502 data triples, but only 4 resources to be imported...

How to map the correct structure to taxonomies?

The IPTC RDF is structued like Subject -> Matter -> Detail (3 level taxonomy hierarchy).

<rdf:RDF>
+
<iptc:Subject rdf:about="http://nets.ii.uam.es/iptc#01000000" iptc:changeComment="none" iptc:changeVersion="0" iptc:title="arts, culture and entertainment" rdfs:label="arts, culture and entertainment">
+
<iptc:SubjectMatter rdf:about="http://nets.ii.uam.es/iptc#01010000" iptc:changeComment="none" iptc:changeVersion="0" iptc:explanation="NA" iptc:title="literature" rdfs:label="literature">
<rdfs:subClassOf rdf:resource="http://nets.ii.uam.es/iptc#01000000"/>
</iptc:SubjectMatter>
+
<iptc:SubjectDetail rdf:about="http://nets.ii.uam.es/iptc#01010001" iptc:changeComment="none" iptc:changeVersion="9" iptc:title="fiction" rdfs:label="fiction">
<rdfs:subClassOf rdf:resource="http://nets.ii.uam.es/iptc#01010000"/>
</iptc:SubjectDetail>
CommentFileSizeAuthor
#5 IPTC-remote_import.png41.05 KBdman
#5 IPTC-terms.png31.71 KBdman
Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Anonymous’s picture

By the way, the four terms stored are: changeCommet, changeVersion, explanation and title.... It should only get the rdfs:label content, like "arts" and "literature".

Anonymous’s picture

Status: Active » Closed (fixed)

Saw it was added as a sample...

tekken’s picture

Title: Help importing IPTC SRS (subject reference system) » Help importing IPTC NewsCodes

The problem I have is similar to the OP. The IPTC website has several newscode vocabularies. I tried to import the following:
http://cv.iptc.org/Requester?scheme=subjectcode&format=rdf

Whatever option I tried, the import failed. Is the format used by IPTC completely incompatible or could these codes be imported after some tweaking of the module code?

tekken’s picture

Status: Closed (fixed) » Active
dman’s picture

FileSize
31.71 KB
41.05 KB

:-/

The taxonomy_xml rdf SKOS import rules use the cannonic URL http://www.w3.org/2004/02/skos/core#Concept for terms as documented in the reference doc.
That IPTC RDF you link to uses a reference to the reference document itself to identify the concept this it is describing. This is not correct, so somebody is confused. http://www.w3.org/TR/skos-reference/skos.html#Concept (which the RDF sample uses) is documentation about the thing. http://www.w3.org/2004/02/skos/core#Concept (Which I use) is the identifier for the thing.

But yeah, there is lots of bad data out there in the wild. At least IPTC is finally trying to do RDF - it wasn't even doing that when I first imported them.
As yet another compromise, I can add their incorrect version as a term type.

Support for this is now added to 6.x-2.x-dev.
Now works as expected for me!

FYI, the way to add new notations for concepts is in rdf_format.inc where a collection of synonyms for 'term' encountered so far is listed.

tekken’s picture

Works like a charm. Thanks!