No Matter what i try importing content with Parse HTML using XPath it just wont work if attached to node type. Standalone form works every now and again.
I've tried 7.x-1.0-beta4 and the problem is the same as with dev and I keep getting errors similar to these
htmlParseEntityRef: expecting ';' on line 38. Error code: 23
htmlParseEntityRef: no name on line 64. Error code: 68
htmlParseStartTag: invalid element name on line 419. Error code: 68
Unexpected end tag : span on line 419. Error code: 76

Would be happy to share my importer here for a closer look but its scraping from an adult site so.... not so sure how drupal would feel about that...

Can anyone please help? I've been pulling my hair over this all day

Comments

twistor’s picture

Category: Bug report » Support request
Status: Active » Fixed

Those errors aren't affecting anything. "Could not retrieve title from feed." means that Feeds could not automatically set the node title from the feed. That just means you have to manually set the node title.

MrPeanut’s picture

Not going to reopen this, because it's just ignorance, but why can't Feeds automatically set the node title?

When I turn on debugging, the xpathparser:1 : does return the title that I'm trying to use. It just won't set the title for some reason.

twistor’s picture

The title, in this case, is the title of the feed. Generic XML doesn't have a predictable title field. The same applies to CSV. It works for RSS/Atom because they usually have a title field.

MrPeanut’s picture

Generic XML doesn't have a predictable title field.

I'm using an XPath Expression to declare my title field. Shouldn't that be able to set the node title then? When I turn on debugging, it's showing the correct value. (Maybe I should open a separate issue.)

twistor’s picture

Your confusing the title of the feed node with the title of the nodes being created.

MrPeanut’s picture

So is it not possible to set the title of the node using Feeds Self Node Processor and an XPath Expression (HTML) with the importer attached to the content type?

The use case is ... create a new node, fill out some of the fields, enter the feed URL (which is actually an HTML page) and have Feeds and Feeds Self Node Processor scrape to get the remaining fields. It works for everything except for the title field.

twistor’s picture

That would be a bug in Self Node Processor and is unrelated to this. But, it should be possible.

MrPeanut’s picture

Okay, thanks. I did create an issue in that queue: #2204391: Set node title from feed (for anyone who may stumble across this later).

Status: Fixed » Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.