When importing nodes with german letters Ä Ö Ü ß in textfield of nodes, the imported node only consists of all letters written before the german letters. Importing of the rest of the textfield stops.

CommentFileSizeAuthor
#5 error.txt3.22 KBsvenhs
#2 node_data.xml_.txt6.74 KBsvenhs
Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Anonymous’s picture

Status: Active » Postponed (maintainer needs more info)

What type of file? Can you please attach a sample?

svenhs’s picture

FileSize
6.74 KB

It happens for example if only stories are exported via XML. Don't know if further content types are involved. I guess yes, maybe a problem with the character encoding. All field mappings stay default. I've attached the file.

Anonymous’s picture

If you change <?xml version="1.0"?> in the file to <?xml version="1.0" encoding="UTF-8"?> before importing does it help? It appears the lack of the encoding attribute is the issue.

Anonymous’s picture

same issue here.
added encoding="UTF-8", but here with me, all data import stops after the first umlaut.
tried it with the users + roles

svenhs’s picture

FileSize
3.22 KB

Changing the encoding to <?xml version="1.0" encoding="UTF-8"?> doesn't help. However I realised that my result isn't really representative. I think testing this with two equal and a brandnew second installations might bring up a completly different result. As far as I tried to set my installations to the equal values and modules etc. it didn't help.

I attached my error messages. Maybe this might help. I'm using the german version of drupal from drupalcenter.de. Despite these error messages the import of the nodes works correct except for the ä ö ü ß as mentioned before. Also the duplicate entries seem not to stop the entry. But probably you'll read more out of the errors.

cha0s’s picture

Have you tried the simpleXML patch I posted?

Ruud’s picture

I have te same problems with xml import in Exel. This time with the following characters:{ ë ï ' ö é ä }.
After replacing these characters with { e i o e a } the xml import is fixed.