Backend active/active cluster #482

eea03 · 2015-07-28T15:14:17Z

UV can now run with 2 (or N in theory) backend processes, which run simultaneously, so executions are processed by two nodes.

each backend must run on its own server (app lock prevents from running multiple backends on one server)
backend cluster was developed and tested only with PostgreSQL. Should work also with MySQL but no tested and no guarantee
backend cluster is purely optional and must be explicitly turned on by configuration properties
by default UV runs in single mode - without any markable changes to previous versions
in cluster mode, frontend communicates with backends via database (checking if backend online) - in single mode, frontend communicates with backend directly via RMI

…ble used for backends exclusive locking; PipelineExecution now has backend_id parameter (=executing backend)

…ions (running executions are now failed, not restarted - configurable behavior, by default old behavior is preserved); Bug fixes; Adaptation of backend unit tests

… scheduler; Fixes in synchronization via DB table

…backends directly via RMI, but only checks timestamp in database

…vival Clear DPU selection after DPU was successfully removed.

…it script

…on, minor refactoring; Unit tests fixes

… turned on by configuration properties both for backend and frontend UV processes; By default, UV behaves exactly the same as before.

…tions in non-cluster mode

…pipelines that it can process; It does not allocate in advance preventing other backend from processing executions

…plate_revival" This reverts commit 11814e6, reversing changes made to 67e674c.

…mode is missing

tomas-knap · 2015-08-18T12:48:16Z

Info about testing the cluster (in SK):

testovane s 2000 QUEUED exekuciami naliatymi do systemu naraz
pomocou SQL skriptov bolo naraz do systemu nasypanych 2000 QUEUED executions
pocas testu bolo este viackrat pridavanych zopar QUEUED exekucii s IGNORE priority = musia sa pustit aj ked uz backend spracovava limit exekucii
testy ukazali, ze kazda exekucia bola spustena prave raz - dokazane cez pocty a statusy exekucii a logy (prislusne SQL commandy su v prilozenom skripte)
obidva backendy spracovali priblizne rovnake mnozstvo exekucii (pravdepodobne by bolo uplne rovnake ak by sa tam nepridavali IGNORE priority exekucie - pri mensich testoch mi vysli uplne rovnake pocty spracovanych exekucii)
testovane aj paralelne spracovanie schedules
naliatych niekolkokrat 10 schedules pre 10 roznych pipeline naraz do systemu - pre kazdy schedule bola vytvorena prave jedna pipeline execution (otestovane cez SQL commandy)
testovanie ukazalo, ze nie su ziadne concurrent issues ani pri vysokom loade

tomas-knap · 2015-08-18T13:45:28Z

Note: In case of cluster, shared directory should be used for DPU templates (JAR files), so that backends are automatically notified as any DPU template is changed. Otherwise, Maintenance of DPUs will be tougher - when certain DPU is imported/replaced via frontend, backends on other servers (using different directory than frontend) has to be updated manually - thus when new version of the DPU is prepared and loaded via UV admin interface, it has to be also manually deployed to backends.

tomas-knap · 2015-08-18T13:48:23Z

Note: Cluster of backends will not work properly in case of DPUs relying on various caches unless the directory with caches is a shared directory that all backends can access.

tomas-knap · 2015-08-18T14:08:36Z

Please unify the way how frontend checks whether backend is online. So even in case of single backend, RMI is not used for checking whether backend is online, but DB is checked instead. (as in cluster mode)

Expected changes for frontend:

No need to define whether frontend works in single/cluster mode using backend.cluster.mode
status update whether backend is online will not be instant, but will have certain delay based on the settings of the param backend.alive.limit (I suggest by default 10s)
when pipeline is run/debug from frontend, it may take up to 2s before certain backend really runs the pipeline

Expected changes for backend:

backend.id is mandatory also in cluster mode
backend.cluster.mode property not needed

This update may be also solved in a separate pull request (if not doable before vacation)

…ld be enough as backend should update its status timestamp each 2 seconds)

…backendCluster

tomas-knap · 2015-08-18T18:38:47Z

I tested single mode, which works fine.

But when I tried cluster mode with mysql I got the following problem:
#509

In #509, there is a suggested solution. Please test whether the suggested solution works. If not and cannot be figured out quickly, we can solve that in a separate pull request.

Otherwise approved.

… MySQL

eea03 · 2015-08-19T12:31:56Z

I fixed the issue with MySQL as suggested.
This time I executed full proof tests both for MySQL and PostgreSQL.

Test description:

2 backend servers (Win 10 + Virtual Ubuntu) - Postgres 9.4.4 / MySQL 5.6.25
200 QUEUED executions of the same pipeline inserted at once at the beginning of the test
3x during test inserted 10 new IGNORE priority QUEUED executions

Test result

for both databases, test was successful
executions were distributed more or less evenly
each execution was started and processed only by one backend
IGNORE priority executions were processed instantly even if limit was exceeded

Everything should be OK now

Backend active/active cluster

eea03 added 4 commits July 24, 2015 15:37

Preparation for backend cluster - active / passive mode; Added new ta…

606aea2

…ble used for backends exclusive locking; PipelineExecution now has backend_id parameter (=executing backend)

Cluster changes implementation; Changed sanitizing of pipeline execut…

bd1539b

…ions (running executions are now failed, not restarted - configurable behavior, by default old behavior is preserved); Bug fixes; Adaptation of backend unit tests

Backend cluster implementation continued; Fixes of initial startup of…

f33237b

… scheduler; Fixes in synchronization via DB table

Merge branch 'develop' into feature/backendCluster

79fe3c1

eea03 added severity: enhancement priority: High status: in progress labels Jul 28, 2015

eea03 self-assigned this Jul 28, 2015

eea03 and others added 9 commits July 29, 2015 15:47

Backend cluster implementation; Frontend no longer communicates with …

a33e116

…backends directly via RMI, but only checks timestamp in database

Bugfix: Backend offline warning shown when none of backends are online

43c7fcf

Merge pull request #483 from UnifiedViews/feature/409_dpu_template_re…

11814e6

…vival Clear DPU selection after DPU was successfully removed.

Added MySQL SQL scripts and Postgres update script; Fixed Postgres in…

2cfa4b1

…it script

First changes towards active/active backend cluster

3bfba6f

Active / active backend UV cluster implementation; Fixes, documentati…

12fd528

…on, minor refactoring; Unit tests fixes

Merge branch 'develop' into feature/backendCluster

73c28fc

UV Backend active/active cluster; Cluster is now optional and must be…

44143e9

… turned on by configuration properties both for backend and frontend UV processes; By default, UV behaves exactly the same as before.

Fix of non-cluster backend executor - backend ID is not set for execu…

37de9ad

…tions in non-cluster mode

eea03 changed the title ~~Backend active/passive cluster~~ Backend active/active cluster Aug 4, 2015

eea03 and others added 4 commits August 4, 2015 17:59

Reverted changes in backend test configuration for cluster unit tests

259eb1d

UV backend cluster enhancement; Backend allocates only such count of …

a44dd04

…pipelines that it can process; It does not allocate in advance preventing other backend from processing executions

Merge branch 'develop' into feature/backendCluster

43eba34

Revert "Merge pull request #483 from UnifiedViews/feature/409_dpu_tem…

9711181

…plate_revival" This reverts commit 11814e6, reversing changes made to 67e674c.

skrchnavy added this to the Release v2.2.0 milestone Aug 6, 2015

eea03 and others added 7 commits August 10, 2015 10:32

Merge branch 'develop' into feature/backendCluster

26e0116

Merge branch 'develop' into feature/backendCluster

79afa26

Merge branch 'develop' into feature/backendCluster

0f615ee

Merge branch 'release/UV_Core_v2.1.3'

df0db38

Updated update script to drop execution view before recreating

501fa12

Updated comment of the public method

f9638ae

Adding logging of situation when config.properties param for cluster …

2d940e2

…mode is missing

Updated comment - returned value

3387dfa

Merge branch 'develop' into feature/backendCluster

a189567

eea03 and others added 4 commits August 18, 2015 16:21

Default limit for backend active status decreased to 10 seconds (shou…

b5470db

…ld be enough as backend should update its status timestamp each 2 seconds)

Merge branch 'develop', remote-tracking branch 'origin' into feature/…

3a73419

…backendCluster

Merge branch 'develop' into feature/backendCluster

fb6f469

Added license header for new files

bfd010d

Bugfix: adjusted allocation SQL query to work for both PostgreSQL and…

3b63d45

… MySQL

eea03 added status: ready for test and removed status: in progress labels Aug 19, 2015

eea03 assigned tomas-knap and unassigned eea03 Aug 19, 2015

tomas-knap mentioned this pull request Aug 21, 2015

Unify the way how frontend checks whether backend is online in case of single vs. cluster mode #514

Closed

tomas-knap added a commit that referenced this pull request Aug 21, 2015

Merge pull request #482 from UnifiedViews/feature/backendCluster

c5820f5

Backend active/active cluster

tomas-knap merged commit c5820f5 into develop Aug 21, 2015

tomas-knap removed the status: ready for test label Aug 21, 2015

tomas-knap deleted the feature/backendCluster branch August 21, 2015 08:36

tomas-knap added resolution: fixed status: resolved labels Aug 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backend active/active cluster #482

Backend active/active cluster #482

eea03 commented Jul 28, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

eea03 commented Aug 19, 2015

Backend active/active cluster #482

Backend active/active cluster #482

Conversation

eea03 commented Jul 28, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

tomas-knap commented Aug 18, 2015

eea03 commented Aug 19, 2015