4
0

Тайлбар байхгүй

Jan Friesse c5d519302c totemudpu: Don't block local socketpair 4 жил өмнө
build-aux 0a17ad25a3 git-version-gen: Fail on UNKNOWN version 7 жил өмнө
common_lib efaa4eb6c3 common_lib: Remove trailing spaces in cs_strerror 5 жил өмнө
conf e1155afb2d corosync.aug: Add missing options 8 жил өмнө
cts 185bc5ba9f NSS_NoDB_Init: the parameter is reserved, must be NULL 7 жил өмнө
exec 5d625cefe8 totemudpu: Don't block local socketpair 4 жил өмнө
include 64010f5738 totem: Add cancel_hold_on_retransmit config option 4 жил өмнө
init 243a4de08e init: Use cpgtool instead of cfgtool 6 жил өмнө
lib 0bcf5eee39 cmap: Fix strncpy warning in cmap_iter_next 7 жил өмнө
man 64010f5738 totem: Add cancel_hold_on_retransmit config option 4 жил өмнө
pkgconfig 94be35249b pkgconfig: Add libqb dependency 6 жил өмнө
qdevices 3eb94907fa qnetd: sort by node_id when add new client 4 жил өмнө
test 8595a8768e tests: Use CS_DISPATCH_BLOCKING instead of cycle 5 жил өмнө
tools 4e61fb0560 cmapctl: check NULL for key type and value for -p 5 жил өмнө
.clang-format 06058e34cf Add clang-format configuration file 9 жил өмнө
.gitarchivever c928a61e1d build: Support for git archive stored tags 7 жил өмнө
.gitattributes c928a61e1d build: Support for git archive stored tags 7 жил өмнө
.gitignore c7ebb09530 cleanup after test-driver 11 жил өмнө
AUTHORS 20a5289074 drop evs service 14 жил өмнө
Doxyfile.in b252013e42 Remove deprecated doxygen flags 9 жил өмнө
INSTALL e8a5c56ab2 Doc: Enhance INSTALL file a bit 11 жил өмнө
LICENSE 8cdd2fc493 Remove libtomcrypt 14 жил өмнө
Makefile.am 7d3979fdba Qnetd: Execute qnetd as non root user 9 жил өмнө
README.recovery 62148d10cf Fix a typo in README.recovery 13 жил өмнө
SECURITY 25f6b0f236 SECURITY: be consistent on the hash algorithm used 7 жил өмнө
autobuild.sh 34e37f130f autobuild: make sure systemd is enabled on f15+ 14 жил өмнө
autogen.sh 76d18f964d build: use libtool for linking 13 жил өмнө
configure.ac 5c1b0266c9 configure.ac: fix pkgconfig issue of rdma 6 жил өмнө
corosync.spec.in f50fac8fd7 spec: Add explicit gcc build requirement 7 жил өмнө
loc aac70d1408 Remove services directory from loc command 14 жил өмнө

README.recovery

SYNCHRONIZATION ALGORITHM:
-------------------------
The synchronization algorithm is used for every service in corosync to
synchronize state of the system.

There are 4 events of the synchronization algorithm. These events are in fact
functions that are registered in the service handler data structure. They
are called by the synchronization system whenever a network partitions or
merges.

init:
Within the init event a service handler should record temporary state variables
used by the process event.

process:
The process event is responsible for executing synchronization. This event
will return a state as to whether it has completed or not. This allows for
synchronization to be interrupted and recontinue when the message queue buffer
is full. The process event will be called again by the synchronization service
if requesed to do so by the return variable returned in process.

abort:
The abort event occurs when during synchronization a processor failure occurs.

activate:
The activate event occurs when process has returned no more processing is
necessary for any node in the cluster and all messages originated by process
have completed.

CHECKPOINT SYNCHRONIZATION ALGORITHM:
------------------------------------
The purpose of the checkpoint syncrhonization algorithm is to synchronize
checkpoints after a paritition or merge of two or more partitions. The
secondary purpose of the algorithm is to determine the cluster-wide reference
count for every checkpoint.

Every cluster contains a group of checkpoints. Each checkpoint has a
checkpoint name and checkpoint number. The number is used to uniquely reference
an unlinked but still open checkpoint in the cluser.

Every checkpoint contains a reference count which is used to determine when
that checkpoint may be released. The algorithm rebuilds the reference count
information each time a partition or merge occurs.

local variables
my_sync_state may have the values SYNC_CHECKPOINT, SYNC_REFCOUNT
my_current_iteration_state contains any data used to iterate the checkpoints
and sections.
checkpoint data
refcount_set contains reference count for every node consisting of
number of opened connections to checkpoint and node identifier
refcount contains a summation of every reference count in the refcount_set

pseudocode executed by a processor when the syncrhonization service calls
the init event
call process_checkpoints_enter

pseudocode executed by a processor when the synchronization service calls
the process event in the SYNC_CHECKPOINT state
if lowest processor identifier of old ring in new ring
transmit checkpoints or sections starting from my_current_iteration_state
if all checkpoints and sections could be queued
call sync_refcounts_enter
else
record my_current_iteration_state

require process to continue

pseudocode executed by a processor when the synchronization service calls
the process event in the SYNC_REFCOUNT state
if lowest processor identifier of old ring in new ring
transmit checkpoint reference counts
if all checkpoint reference counts could be queued
require process to not continue
else
record my_current_iteration_state for checkpoint reference counts

sync_checkpoints_enter:
my_sync_state = SYNC_CHECKPOINT
my_current_iteration_state set to start of checkpont list

sync_refcounts_enter:
my_sync_state = SYNC_REFCOUNT

on event receipt of foreign ring id message
ignore message

pseudocode executed on event receipt of checkpoint update
if checkpoint exists in temporary storage
ignore message
else
create checkpoint
reset checkpoint refcount array

pseudocode executed on event receipt of checkpoint section update
if checkpoint section exists in temporary storage
ignore message
else
create checkpoint section

pseudocode executed on event receipt of reference count update
update temporary checkpoint data storage reference count set by adding
any reference counts in the temporary message set to those from the
event
update that checkpoint's reference count
set the global checkpoint id to the current checkpoint id + 1 if it
would increase the global checkpoint id

pseudocode called when the synchronization service calls the activate event:
for all checkpoints
free all previously committed checkpoints and sections
convert temporary checkpoints and sections to regular sections
copy my_saved_ring_id to my_old_ring_id

pseudocode called when the synchronization service calls the abort event:
free all temporary checkpoints and temporary sections