There is a feed full of images: http://art.gnome.org/backend.php

Its corresponding aggregator feed doesn't show any image, though: http://gnomedotorg.ourproject.org/index.php?q=aggregator/sources/7

Could you tell what is wrong here? Thank you.

Comments

Edward C. Zimmermann’s picture

That RSS feed is trash. Inlined images don't belong in the description field (and if one is going to throw in inlined HTML you need to use namesspaces).
There is in RSS an image field (one per item) and there is in RSS 2.0 the concept of an enclosure which can be anything including an image.

--
Edward C. Zimmermann <edz@nonmonotonic.net>
BSn's Nonmonotonic Lab

qgil’s picture

Hi Edward, thanks for your quick answer.

I wish I could talk about the quality of the code, but this falls beyond my knowledge.

All I know is that I'm a modest Drupal admin and I'm trying to have a valid RSS feed aggregated, that's all.

Apparently most rss feeders will parse the description field as html. Is Drupal's aggregator considering this capacity?

Edward C. Zimmermann’s picture

All I know is that I'm a modest Drupal admin and I'm trying to have a valid RSS feed aggregated, that's all.

The RSS feed validator is then itself defect following interpretation of RSS 2.0 to accept any non-namespaced markup as HTML in the description field. This is not only a widespread mistake (from lifting the restriction in RSS 0.9 to disallow anything but common entities) but if taken at face value is very dangerous and allows for wonderfull exploits (not just pop-up ads but also trojan and quite nasty tricks) and effectively means that one can't use RSS 2.0. A lot of vendors have a vested interest in the spread of this wrong turn and a lot of software can't do it right so its a bit out of control--- OK.. in Drupal if I set RSS 0.9 it still throws the kitchen sink into description and there its very clear that NO tags and NO entities (other than the few common ones) are allowed.
The point of the IMAGE and other fields is to move the stuff where it belongs..
Software that won't render that page I think are a good thing.. and only then will there perhaps be enough motivation for people to get on the path to do things right.. (and as a nice side-effect, search gets better).

Edward C. Zimmermann <edz@nonmonotonic.net>
BSn's Nonmonotonic Lab

sepeck’s picture

4.6.0/1/2/3 aggregator uses it's own HTML filter to filter incoming content. In 4.6.4/5 this filter was moved to filter module but there is no GUI tool to expose it to allow the user to configure allowed html tags. In 4.7 the filters for aggregator are user configurable through the admin interface. The default configuration for the filters is to protect your site.

If this is the result you want, then read here and then here for the updated fix.

They have a little article on how I found out how to make this work as I wanted it to that may help you in the future.

-sp
---------
Test site, always start with a test site.
Drupal Best Practices Guide -|- Black Mountain

-Steven Peck
---------
Test site, always start with a test site.
Drupal Best Practices Guide

Sam308’s picture

If I understand what you are asking, it is simple to add images to your incoming feeds.

To get the images to show up in your incoming RSS feeds, just add the <img> tag to your "Allowed HTML tags" settings under:

Home » Administer » Settings » aggregator

Sam Raheb (Sam308)