List:Cluster« Previous MessageNext Message »
From:Brancaleoni Matteo Date:June 20 2004 7:54am
Subject:DB node hang on start
View as plain text  
Hi,

I cannot figure out what's happening on my test
cluster setup. Till now a single machine
with mgm, 1 db and 1 api works ok.
Now I want to add a db node on another machine, so
changed the mgm config.ini and added the proper tcp
connections. the DB node on the same
machine the mgm is starts ok,
the other DB node connects to the mgm node,
get the conf but then stops at :

[root@bestia ndb_data1]# /usr/local/mysql/libexec/ndbd
2004-06-20 09:32:50 [NDB] INFO     -- Angel pid: 5895 ndb pid: 5896
2004-06-20 09:32:50 [NDB] INFO     -- NDB Cluster -- DB node 3
2004-06-20 09:32:50 [NDB] INFO     -- Version 3.5.0 (beta) --
2004-06-20 09:32:50 [NDB] INFO     -- Start initiated (version 3.5.0)

and on the mgm console I see only the DB node #2 as started
(the one that's on the same machine ad mgm node) and
the node #3 with 'no connection' :

NDB> all status
Node 2: Started (Version 3.5.0)
 
Node 3: No contact

with node #2:
2004-06-20 16:49:34 [NDB] INFO     -- Angel pid: 5558 ndb pid: 5560
2004-06-20 16:49:34 [NDB] INFO     -- NDB Cluster -- DB node 2
2004-06-20 16:49:34 [NDB] INFO     -- Version 3.5.0 (beta) --
2004-06-20 16:49:34 [NDB] INFO     -- Start initiated (version 3.5.0)
Dbdict: name=sys/def/SYSTAB_0,id=0
Dbdict: name=sys/def/NDB$EVENTS_0,id=2
Dbdict: name=test/def/matteotabella2,id=4
Dbdict: name=test/def/4/PRIMARY,id=6
Dbdict: name=test/def/matteo,id=8
Dbdict: name=test/def/8/PRIMARY,id=10
Dbdict: name=test/def/mytabella,id=12
Dbdict: name=test/def/12/PRIMARY,id=14
2004-06-20 16:50:12 [NDB] INFO     -- Started (version 3.5.0)

and node #3 stick at
2004-06-20 09:52:11 [NDB] INFO     -- Angel pid: 10128 ndb pid: 10129
2004-06-20 09:52:11 [NDB] INFO     -- NDB Cluster -- DB node 3
2004-06-20 09:52:11 [NDB] INFO     -- Version 3.5.0 (beta) --
2004-06-20 09:52:11 [NDB] INFO     -- Start initiated (version 3.5.0)

I can wait a lot of time... but also with tcpdump
I cannot see any traffic flowing from/to computer#2 to computer#1,
except for the initial connection to the mgm on port 2200
Also the cpu is at 0%.

Any hint on where I can take a look?
There's a way to enable more logging to the DB node at startup?

the hosts are on a local lan, so I can ping between them with
hostname or ipaddr and no firewall is involved.

the source version installed is the same, the only difference
is that one machine runs kernel 2.4.26, the other (the mgm node) 2.6.5

matteo.


-- 
Brancaleoni Matteo <mbrancaleoni@stripped>
Espia Srl

Thread
DB node hang on startBrancaleoni Matteo20 Jun
  • Re: DB node hang on startTomas Ulin20 Jun
    • Re: DB node hang on startBrancaleoni Matteo20 Jun
      • Re: DB node hang on startTomas Ulin21 Jun
      • Re: DB node hang on startTomas Ulin21 Jun
        • Re: DB node hang on startMatteo Brancaleoni21 Jun
          • Re: DB node hang on startTomas Ulin21 Jun
            • Re: DB node hang on startTomas Ulin21 Jun
              • Re: DB node hang on startMatteo Brancaleoni21 Jun
                • Re: DB node hang on startTomas Ulin21 Jun
                  • Re: DB node hang on startBrancaleoni Matteo21 Jun
                • Re: DB node hang on starttulin23 Jun
                  • Re: DB node hang on startMatteo Brancaleoni23 Jun
                    • Re: DB node hang on startTomas Ulin23 Jun
                      • Re: DB node hang on startMatteo Brancaleoni23 Jun
                  • Re: DB node hang on startMatteo Brancaleoni23 Jun
Re: DB node hang on startTomas Ulin22 Jun