Load balancing implementation [#267831]

When Solr is being run in a replicated environment, writes (indexing) needs to go to a master server, where as searches (reads) can go to any of the slaves which are generally configured with a load balancer / round robin DNS.

So, there should be the option to use different instance information for indexing and for searching.

I'm going to take this forward.

Comment	File	Size	Author
#40	apachesolr-balacing-delayed-better.patch	32.2 KB	pounard
#39	apachesolr-balacing-delayed.patch	27.83 KB	pounard
#38	apachesolr-balancer-267831-D6-exported-availability-checks-RIGHT.patch	180.3 KB	pounard
#37	apachesolr-balancer-267831-D6-exported-availability-checks.patch	24 KB	pounard
#29	apachesolr-balancer-267831-D6.patch	26.45 KB	claudiu.cristea
#28	apachesolr-multiservers-267831-5-D5.patch	25.67 KB	claudiu.cristea
#27	apachesolr-multiservers-267831-4-D5.patch	30.55 KB	claudiu.cristea
#27	apachesolr-balancer-1.png	58.54 KB	claudiu.cristea
#27	apachesolr-balancer-2.png	60.99 KB	claudiu.cristea
#27	apachesolr-balancer-3.png	26.4 KB	claudiu.cristea
#16	apachesolr_multiservers_1-267831.patch	20.97 KB	claudiu.cristea
#12	apachesolr_multiservers-267831.patch	21.15 KB	claudiu.cristea
#12	Screenshot.png	24.11 KB	claudiu.cristea
#12	Screenshot-1.png	65.48 KB	claudiu.cristea
#12	Screenshot-2.png	62.85 KB	claudiu.cristea
#2	solr_multi_instance.diff	14.66 KB	JacobSingh

Comments

Comment #1

robertdouglass commented 13 June 2008 at 06:46

Start looking in the apachesolr/SolrPhpClient/Apache/Solr/Service/Balancer.php file. I am also in the process of updating this whole package to the next version which you can find here: https://issues.apache.org/jira/browse/SOLR-341

Comment #2

JacobSingh commented 13 June 2008 at 08:18

Status:

Active

» Needs review

Status	File	Size
new	solr_multi_instance.diff	14.66 KB

D'oh, I might have done this differently...

Anyway, here is a patch which allows for multiple instances of slave / master pairs. Please take a look. I know it needs some work, but if you are already working on this, it would be good to know.

Comment #3

robertdouglass commented 7 September 2008 at 19:15

Not sure of this for 1.0. Jacob, can you make the argument that we absolutely need this for 1.0?

Comment #4

anarchivist commented 19 June 2009 at 22:56

Version:

5.x-1.x-dev

» 6.x-1.x-dev

This seems somewhat abandoned, and there seems like there's relatively recent interest given #434314: Load balance search queries. I may look into rerolling JacobSingh's patch for 6.x-1.x-dev this weekend.

Comment #5

robertdouglass commented 20 June 2009 at 08:58

@anarchivist I think there'd be interest in a revived patch.

Comment #6

Scott Reynolds commented 21 June 2009 at 23:14

I second that comment. After talking at length with how to scale out our Solr instance, I would like to have one master Solr instance just for the indexing.

Comment #7

robertdouglass commented 17 July 2009 at 10:48

Version:

6.x-1.x-dev

» 6.x-2.x-dev

I'd love it if you could run with this, Scott.

Comment #8

anarchivist commented 21 December 2009 at 22:20

Wow, this issue has sat for a while! :) We ended up using a load balanced setup, with a proxy in front of the load balancer to point update requests to a separate Solr server. I'm inclined to think that this would be the best way to do this, and we might be better served with documentation about how to get this set up...

Comment #9

robertdouglass commented 21 December 2009 at 22:53

@anarchivist - any tips from your configuration can be included in the documentation. Would love to hear about how you did it.

Comment #10

anarchivist commented 22 December 2009 at 13:57

Sure thing. I'll do my best to work on it, as I'm supposed to be documenting it at work, too. :)

Comment #11

robertdouglass commented 26 December 2009 at 12:58

Status:

Needs review

» Needs work

Comment #12

claudiu.cristea

Romanian

Arad 🇷🇴

commented 19 January 2010 at 17:34

Status:

Needs review

» Needs work

Status	File	Size
new	Screenshot-2.png	62.85 KB
new	Screenshot-1.png	65.48 KB
new	Screenshot.png	24.11 KB
new	apachesolr_multiservers-267831.patch	21.15 KB

I need this functionality too...

Here's a first attempt to implement multi Solr servers against DRUPAL-5--2. Right now this patch provides only the ability to allow different servers for querying and indexing. The "load balancing" feature is implemented only in terms of defining members in the load balancer.

A new tab (Solr Servers) is added to ApacheSolr settings page. See the first image. That page provides tools for adding/editing/deleting Solr servers. Also it provides the ability to configure which server is the indexer and which is part of the load balancer.

Any feed-back is welcomed.

Screenshots:

admin/settings/apachesolr - New tab
admin/settings/apachesolr/servers/0 - Editing a server
admin/settings/apachesolr/servers - Adding a new server

Comment #13

claudiu.cristea

Romanian

Arad 🇷🇴

commented 19 January 2010 at 17:33

Version:	6.x-2.x-dev	» 5.x-2.x-dev
Status:	Needs work	» Needs review

Comment #14

Scott Reynolds commented 19 January 2010 at 17:49

Assigned:

JacobSingh

» Unassigned

So very cool. All the Form API stuff I haven't reviewed, and thats a big part of this patch. Unfort, I don't have a D5 site I maintain so my review is just a read through of the code.

So as I understand this patch, I can add load balancers, query only and indexers. But as I understand the changes to apachesolr_get_solr(), if I ask for 'query' I will only get back the first load balancer. Doesn't matter how many load balancer's I have, I always get the first one. Same is true for 'indexer'.

So I think we need a round robin system for this. And the only way to really move variables across multiple sessions is variable_gets/sets. And variable_gets/sets are cached so not sure how effective that will be.

Also, why bother using the key for the load balancer as "balancer" and the indexer server as "indexer". Seems to me, the only place those variables are used is apachesolr_get_solr(), so might as well have them line up with the $service_type. Makes the code easier to read and when doing a dump of the variable table, it will make sense ('balancer' == 'query' ? thats confusing).

Comment #15

claudiu.cristea

Romanian

Arad 🇷🇴

commented 19 January 2010 at 18:12

Thanks for your review...

As I state in my comment the "load balancer" feature is not implemented. I only want to open the door to load balancing by defining multiple servers that will took part in the load balancer. In order to select the "query" server (which is unique right now) I'm picking up the first server that is "load balancer member". This is a temporary solution until we will learn to "balance"...

For the index server the things are different. Only one server can be "index server". So the first one will be always the single one...

Yes, renaming 'balancer' to 'query' can add more clarity to the code

Comment #16

claudiu.cristea

Romanian

Arad 🇷🇴

commented 19 January 2010 at 20:37

Assigned:

Unassigned

» JacobSingh

Status	File	Size
new	apachesolr_multiservers_1-267831.patch	20.97 KB

Changed:

"balancer" => "query"
"indexer" => "index"

It seems less confusing to me too...

Comment #17

claudiu.cristea

Romanian

Arad 🇷🇴

commented 19 January 2010 at 20:38

Assigned:

JacobSingh

» Unassigned

Comment #18

claudiu.cristea

Romanian

Arad 🇷🇴

commented 19 January 2010 at 21:01

Status:

Needs work

» Needs review

Comment #19

claudiu.cristea

Romanian

Arad 🇷🇴

commented 20 January 2010 at 15:56

@Scott Reynolds:

So I think we need a round robin system for this. And the only way to really move variables across multiple sessions is variable_gets/sets. And variable_gets/sets are cached so not sure how effective that will be.

There is a file, in the Solr PHP client, SolrPhpClient/Apache/Solr/Service/Balancer.php that it seems to do the job... The problem here is that this file defines a class Apache_Solr_Service_Balancer which uses Solr services of type Apache_Solr_Service (defined in SolrPhpClient/Apache/Solr/Service.php) while we are using our own class Drupal_Apache_Solr_Service which extends Apache_Solr_Service.

I don't have a clear picture right now about the reasons for that extension...

Looking in the code, I found there a real load balancing based on ping timeouts and not a simple rotation. The class is trying to find if a server is heavily loaded before deciding to use it or not...

I'm not an "expert" in Solr PHP client but a way to deal with this is to create a class that extends the Balancer class and to rewrite only methods that are referring to Apache_Solr_Service, replacing him with his successor Drupal_Apache_Solr_Service.

Any thoughts?

Comment #20

Scott Reynolds commented 20 January 2010 at 16:54

Right so Apache_Solr_Service_Balancer wraps the two arrays of writable and readable Solr objects. Those Solr objects can be the Drupal Solr objects just fine. I seem to remember though that changing the code to just use this Class wasn't quite equivalent to what we are doing. But looking at it now, it doesn't stand out. Would be interested to see what happens when we try to replace the Service implementation with the Balancer implementation.

Looks like the "interface" for interacting with the object is equivalent.

Comment #21

claudiu.cristea

Romanian

Arad 🇷🇴

commented 20 January 2010 at 18:02

Yes. The "interface" that is applicable to a Balancer is the same. For example the public methods: add(), addDocument(), addDocuments(), commit(), delete(), deleteById(), deleteByQuery(), optimize(), search() are called in the same way, with the same argument lists... So, replacing the Solr object with a Solr Balancer object should work.

There are also other methods that need recoding. Just an example. In apachesolr_requirements() we are pinging the server to see if is up.

      $solr = apachesolr_get_solr();
      $ping = @$solr->ping(variable_get('apachesolr_ping_timeout', 4));

I cannot see any method or variable to access a specific server through the Balancer object... In this case we will have to build the object as Apache_Solr_Service and use the ping() method...

You're right, I think, we should:

Build the list of writables & redables as Drupal_Apache_Solr_Service type
Cache them for later use in the request
Create the Balancer object using the previous lists
Use Balancer or Service, by case

Comment #22

Scott Reynolds commented 20 January 2010 at 18:18

Well a majority of the 'pinging' happens just prior to executing a command. I believe the Balancer.php handles that as well in its code. But to your point, hook_requirements would have to be rewritten for this anyway, as we would like to loop through all servers.

So i propose we extend the Balancer class and add our 'ping' method (I might call it something else like ping_all_servers). Which is missing from the existing patch btw, checking indexing and query servers to make sure they are up.

Comment #23

claudiu.cristea

Romanian

Arad 🇷🇴

commented 20 January 2010 at 18:39

Well, it's not only ping(). We have also: clearCache(), getLuke(), getStatsSummary(), getFields(), deleteMultipleById()... These are on a first look...

Comment #24

claudiu.cristea

Romanian

Arad 🇷🇴

commented 21 January 2010 at 08:17

I'm OK with extending the Balancer object.

In order to allow control to the balancer but also to a specific server I'm thinking to refactor the apachesolr_get_solr() object in this way:

No arguments

// Returns the Apache_Solr_Service_Balancer object
$solr = apachesolr_get_solr();

Numeric argument

// Returns the Drupal_Apache_Solr_Service object corresponding to that server ID
$solr = apachesolr_get_solr(2);

Keyed array argument with connection infos.

// Returns the Drupal_Apache_Solr_Service object corresponding to the server with those connection infos.
$solr = apachesolr_get_solr(array('host' => 'localhost', 'port' => '8983', 'path' => '/solr'));

String containing the connection URL. BTW: In my patch I forgot to do a validation for connection URL duplicates (you cannot add the same server twice!).

// Returns the Drupal_Apache_Solr_Service object corresponding to the server with this connection string.
$solr = apachesolr_get_solr('localhost:8983/solr');

This, together with extending Balancer (for multi-pings, etc) for accessing both, the Balancer and a single Service. Course we will microcache using static all those objects inside the function.

Any thoughts?

Comment #25

Scott Reynolds commented 21 January 2010 at 18:35

Im not a fan of a function that accepts multiple different argument types. So I would purpose.

$solr = apachesolr_get_balancer();

$solr = apachesolr_get_server($host, $port, $path)

I think that will make the code clear. But really other then my lil oppressive compulsiveness with function names, the plan makes sense.

Comment #26

claudiu.cristea

Romanian

Arad 🇷🇴

commented 21 January 2010 at 21:14

OK... No complain on this. But... apachesolr_get_server() should take also the server ID (delta) as argument... Don't have an example right now but I feel that we need that kind of flexibility....

I will try to create a patch on based on last comments... Then I will try to port it on 6.x-1.x-dev so that you can test it (porting will be in "blind" mode - I don't have a 6.x-1.x-dev installed!)

Comment #27

claudiu.cristea

Romanian

Arad 🇷🇴

commented 26 January 2010 at 18:53

Title:

Allow Solr to configure different hosts for indexing and searching

» Load balancing implementation

Status	File	Size
new	apachesolr-balancer-3.png	26.4 KB
new	apachesolr-balancer-2.png	60.99 KB
new	apachesolr-balancer-1.png	58.54 KB
new	apachesolr-multiservers-267831-4-D5.patch	30.55 KB

Voila! Here's a functional load balancing Apache Solr implementation based on SolrPhpClient/Apache/Solr/Service/Balancer.php. And you know what? It's working!

The patch is against DRUPAL-5--2. It would be nice if someone will try to port to a 6.x branch... I can do that but I don't have a 6.x installation so I will work as a blind man. And I don't want to do it unless someone really want to test it.

Screenshots:

TODO: I'm a little bit confused about how some functionality like ping(), getStatsSummary(), getFields(), getLuke(), will work on a load balancer. I've implemented this based on balancer "first came, first served". I think that this needs some discussions and dissemination.

Comment #28

claudiu.cristea

Romanian

Arad 🇷🇴

commented 27 January 2010 at 11:44

Status	File	Size
new	apachesolr-multiservers-267831-5-D5.patch	25.67 KB

Improving performance when a single server is used (no balancer). We don't need to load all Balancer API/object if we don't need it.

For a good abstraction I switched back to apachesolr_get_solr() (Sorry @Scott Reynolds). Now apachesolr_get_solr() returns a Drupal_Apache_Solr_Service_Balancer object if there are more than one server configured and a Drupal_Apache_Solr_Service object if we have only one server configured. All methods applicable to Drupal_Apache_Solr_Service should work also with Drupal_Apache_Solr_Service_Balancer in an abstract way. New/missed methods can be added to the class in the new file Drupal_Apache_Solr_Service_Balancer.php.

Comment #29

claudiu.cristea

Romanian

Arad 🇷🇴

commented 27 January 2010 at 16:26

Version:

5.x-2.x-dev

» 6.x-1.x-dev

Status	File	Size
new	apachesolr-balancer-267831-D6.patch	26.45 KB

Here's a NOT tested patch against DRUPAL-6--1. It may contain errors.

@Scott Reynolds, Can you test it?

Comment #30

Scott Reynolds commented 27 January 2010 at 17:06

Status:

Needs review

» Needs work

I will here soon I hope. Depends on how today goes and my motivation

But on first read

return $service->deleteByMultipleIds($ids, $fromPending = true, $fromCommitted = true, $timeout = 3600);

Eek! get rid of the default values.

And what is this pattern? What is the Exception class when code = 0 so we can use multiple catch statements. And comments should be above, start with a capital and end with a period.

   catch (Exception $e) {
        if ($e->getCode() != 0) { //IF NOT COMMUNICATION ERROR
          throw $e;
        }
      }

t('Server !host:!port/!path was saved as %name

Could be turned into one thing

t('Server !host_path was saved as %name

so those are my notes on first read. I will go through it and fix those.

Comment #31

claudiu.cristea

Romanian

Arad 🇷🇴

commented 27 January 2010 at 17:48

@Scott Reynolds

And what is this pattern? What is the Exception class when code = 0 so we can use multiple catch statements. And comments should be above, start with a capital and end with a period.
   catch (Exception $e) {
        if ($e->getCode() != 0) { //IF NOT COMMUNICATION ERROR
          throw $e;
        }
      }

This piece of code was inspired (in fact copied) from the Balancer.php code... This is the way used to "balance" between hosts. And I forgot there the comment as it was in the Balancer.php. The exception code check should be correct.

I agree for the rest of comments....