Support for Drupal 7 is ending on 5 January 2025—it’s time to migrate to Drupal 10! Learn about the many benefits of Drupal 10 and find migration tools in our resource center.
When importing nodes with german letters Ä Ö Ü ß in textfield of nodes, the imported node only consists of all letters written before the german letters. Importing of the rest of the textfield stops.
It happens for example if only stories are exported via XML. Don't know if further content types are involved. I guess yes, maybe a problem with the character encoding. All field mappings stay default. I've attached the file.
If you change <?xml version="1.0"?> in the file to <?xml version="1.0" encoding="UTF-8"?> before importing does it help? It appears the lack of the encoding attribute is the issue.
Changing the encoding to <?xml version="1.0" encoding="UTF-8"?> doesn't help. However I realised that my result isn't really representative. I think testing this with two equal and a brandnew second installations might bring up a completly different result. As far as I tried to set my installations to the equal values and modules etc. it didn't help.
I attached my error messages. Maybe this might help. I'm using the german version of drupal from drupalcenter.de. Despite these error messages the import of the nodes works correct except for the ä ö ü ß as mentioned before. Also the duplicate entries seem not to stop the entry. But probably you'll read more out of the errors.
I have te same problems with xml import in Exel. This time with the following characters:{ ë ï ' ö é ä }.
After replacing these characters with { e i o e a } the xml import is fixed.
Comments
Comment #1
Anonymous (not verified) CreditAttribution: Anonymous commentedWhat type of file? Can you please attach a sample?
Comment #2
svenhs CreditAttribution: svenhs commentedIt happens for example if only stories are exported via XML. Don't know if further content types are involved. I guess yes, maybe a problem with the character encoding. All field mappings stay default. I've attached the file.
Comment #3
Anonymous (not verified) CreditAttribution: Anonymous commentedIf you change
<?xml version="1.0"?>
in the file to<?xml version="1.0" encoding="UTF-8"?>
before importing does it help? It appears the lack of the encoding attribute is the issue.Comment #4
Anonymous (not verified) CreditAttribution: Anonymous commentedsame issue here.
added encoding="UTF-8", but here with me, all data import stops after the first umlaut.
tried it with the users + roles
Comment #5
svenhs CreditAttribution: svenhs commentedChanging the encoding to <?xml version="1.0" encoding="UTF-8"?> doesn't help. However I realised that my result isn't really representative. I think testing this with two equal and a brandnew second installations might bring up a completly different result. As far as I tried to set my installations to the equal values and modules etc. it didn't help.
I attached my error messages. Maybe this might help. I'm using the german version of drupal from drupalcenter.de. Despite these error messages the import of the nodes works correct except for the ä ö ü ß as mentioned before. Also the duplicate entries seem not to stop the entry. But probably you'll read more out of the errors.
Comment #6
cha0s CreditAttribution: cha0s commentedHave you tried the simpleXML patch I posted?
Comment #7
Ruud CreditAttribution: Ruud commentedI have te same problems with xml import in Exel. This time with the following characters:{ ë ï ' ö é ä }.
After replacing these characters with { e i o e a } the xml import is fixed.