Hey there, I'm using the SemanticProxy on a test site and it workds pretty well!!

I still get this error once in a while :

"SemanticProxy processing error: Internal Exception when trying to use Calais WS (RemoteException)) - Text length has exceeded the allowed size ."

Any ideas?

thanks,
Patchak

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

febbraro’s picture

Hello @patchak thanks for the report.

That is an internal SemanticProxy error, not sure specifically what it means (besides the text was too long) :p

We can try submitting this to the SemanticProxy team at http://www.opencalais.com/forum/213 to see what they say. Are you able to reliably reproduce the error with a specific URL? Or will sometimes the same URL submit fine and other times fail?

Thanks again.

patchak’s picture

It seems this has been fixed... http://www.opencalais.com/forums/semanticproxy/text-too-long-error

Thanks for the tip!

febbraro’s picture

Status: Active » Fixed

Great news.

Cleaning up.

Status: Fixed » Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.

mikeytown2’s picture

Version: 6.x-3.1 » 6.x-3.2
Status: Closed (fixed) » Active

I'm trying to do it with a document that has almost a megabyte worth of text; at a minimum can this submit up to the max & truncate the extra, and store the truncated nid's in a table, so I know which ones it failed on.

mikeytown2’s picture

Version: 6.x-3.2 » 6.x-3.x-dev
Component: SemanticProxy » Miscellaneous
Status: Active » Needs review
FileSize
11.13 KB

Better yet, this splits up the doc and merges the objects... at least I think that's what it's doing. As a bonus?, my editor is removing CRLF's and making them into LF's

mikeytown2’s picture

Title: Semantic Proxy error » Allow for large documents to be indexed
mikeytown2’s picture

FileSize
12.67 KB

Above patch only gets the last one; had to go ghetto... here's a patch that works, but is much slower. Oh and it's removing extra whitespace.

mikeytown2’s picture

Status: Needs review » Active

nope, this doesn't work either...

mikeytown2’s picture

Modifying the first patch so it works would be ideal, but I don't know the $keywords object structure, or if that is the right way to do this. Also one would have to get the average of the relevance to do it correctly. If Calais didn't overwrite the data, then my second way might work.

mikeytown2’s picture

Status: Active » Needs work
FileSize
10.94 KB

This is what I'm going to use for the time being.

febbraro’s picture

@miketown2

Thanks for this. I like the idea, however your last patch does not have all of the code though, it has something for the breakup, but not the individual submits. Can you try rerolling it so that I can see if it will apply cleanly?

mikeytown2’s picture

Status: Needs work » Needs review
FileSize
11.06 KB

I only submit the first part since merging the data was a pain, due to it being stored in an object.

ElusiveMind’s picture

Issue summary: View changes

Semantic proxy is no longer offered and has been removed from Open Calais.

ElusiveMind’s picture

Status: Needs review » Closed (won't fix)