Support for Drupal 7 is ending on 5 January 2025—it’s time to migrate to Drupal 10! Learn about the many benefits of Drupal 10 and find migration tools in our resource center.
I'm trying to import this full catalog (official news subject tagging system):
http://nets.ii.uam.es/neptuno/iptc/iptc-srs.rdfs
On import, it recognizes 8502 data triples, but only 4 resources to be imported...
How to map the correct structure to taxonomies?
The IPTC RDF is structued like Subject -> Matter -> Detail (3 level taxonomy hierarchy).
<rdf:RDF>
+
<iptc:Subject rdf:about="http://nets.ii.uam.es/iptc#01000000" iptc:changeComment="none" iptc:changeVersion="0" iptc:title="arts, culture and entertainment" rdfs:label="arts, culture and entertainment">
+
<iptc:SubjectMatter rdf:about="http://nets.ii.uam.es/iptc#01010000" iptc:changeComment="none" iptc:changeVersion="0" iptc:explanation="NA" iptc:title="literature" rdfs:label="literature">
<rdfs:subClassOf rdf:resource="http://nets.ii.uam.es/iptc#01000000"/>
</iptc:SubjectMatter>
+
<iptc:SubjectDetail rdf:about="http://nets.ii.uam.es/iptc#01010001" iptc:changeComment="none" iptc:changeVersion="9" iptc:title="fiction" rdfs:label="fiction">
<rdfs:subClassOf rdf:resource="http://nets.ii.uam.es/iptc#01010000"/>
</iptc:SubjectDetail>
Comment | File | Size | Author |
---|---|---|---|
#5 | IPTC-remote_import.png | 41.05 KB | dman |
#5 | IPTC-terms.png | 31.71 KB | dman |
Comments
Comment #1
Anonymous (not verified) CreditAttribution: Anonymous commentedBy the way, the four terms stored are: changeCommet, changeVersion, explanation and title.... It should only get the rdfs:label content, like "arts" and "literature".
Comment #2
Anonymous (not verified) CreditAttribution: Anonymous commentedSaw it was added as a sample...
Comment #3
tekken CreditAttribution: tekken commentedThe problem I have is similar to the OP. The IPTC website has several newscode vocabularies. I tried to import the following:
http://cv.iptc.org/Requester?scheme=subjectcode&format=rdf
Whatever option I tried, the import failed. Is the format used by IPTC completely incompatible or could these codes be imported after some tweaking of the module code?
Comment #4
tekken CreditAttribution: tekken commentedComment #5
dman CreditAttribution: dman commented:-/
The taxonomy_xml rdf SKOS import rules use the cannonic URL http://www.w3.org/2004/02/skos/core#Concept for terms as documented in the reference doc.
That IPTC RDF you link to uses a reference to the reference document itself to identify the concept this it is describing. This is not correct, so somebody is confused. http://www.w3.org/TR/skos-reference/skos.html#Concept (which the RDF sample uses) is documentation about the thing. http://www.w3.org/2004/02/skos/core#Concept (Which I use) is the identifier for the thing.
But yeah, there is lots of bad data out there in the wild. At least IPTC is finally trying to do RDF - it wasn't even doing that when I first imported them.
As yet another compromise, I can add their incorrect version as a term type.
Support for this is now added to 6.x-2.x-dev.
Now works as expected for me!
FYI, the way to add new notations for concepts is in rdf_format.inc where a collection of synonyms for 'term' encountered so far is listed.
Comment #6
tekken CreditAttribution: tekken commentedWorks like a charm. Thanks!