So I'm running this webportal, aggregating blogs from various sources. For most parts things are running smoothly but there are two issues that occur repeatedly and I rather need them to be fixed.
- Sporadic encoding problems.
All the blogs are in Icelandic and thus most of them encoded with iso-8859-1. As Drupal is using UTF-8 encoding there seems to be a clash every now and then (very irregularily), leading to the fact that all non-english characters in a given blog entry (á, æ, ö, ð, þ to name a few) are displayed in a weird character combination (á, æ, Ö, ð, þ to name a few).
This does not apply to all feeds at the same time, and not always the same feeds, but some are more at risk than others.
Most of the time this can be fixed by manually updating the feed, but that is not always the case.
It has been suggested to me that the iso-8859-1 encoded feeds are being stored in Drupal as utf-8 without being transcoded first, but I don't have a faintest idea how to solve this or where to solve it, for that matter.
Searching the drupal website suggests that there's a function named drupal_convert_to_utf8()
and to me it looks like a possible fix, but what I'm lacking is the skill to apply it to the problem.
Any suggestions?
- Blogger feed aggregation problems.
As with the other problem, I've run across that one around here somewhere, but here's the gist of it: