From: Brancaleoni Matteo Date: June 20 2004 7:54am Subject: DB node hang on start List-Archive: http://lists.mysql.com/cluster/13 Message-Id: <1087718088.2606.11.camel@athlon> MIME-Version: 1.0 Content-Type: text/plain Content-Transfer-Encoding: 7bit Hi, I cannot figure out what's happening on my test cluster setup. Till now a single machine with mgm, 1 db and 1 api works ok. Now I want to add a db node on another machine, so changed the mgm config.ini and added the proper tcp connections. the DB node on the same machine the mgm is starts ok, the other DB node connects to the mgm node, get the conf but then stops at : [root@bestia ndb_data1]# /usr/local/mysql/libexec/ndbd 2004-06-20 09:32:50 [NDB] INFO -- Angel pid: 5895 ndb pid: 5896 2004-06-20 09:32:50 [NDB] INFO -- NDB Cluster -- DB node 3 2004-06-20 09:32:50 [NDB] INFO -- Version 3.5.0 (beta) -- 2004-06-20 09:32:50 [NDB] INFO -- Start initiated (version 3.5.0) and on the mgm console I see only the DB node #2 as started (the one that's on the same machine ad mgm node) and the node #3 with 'no connection' : NDB> all status Node 2: Started (Version 3.5.0) Node 3: No contact with node #2: 2004-06-20 16:49:34 [NDB] INFO -- Angel pid: 5558 ndb pid: 5560 2004-06-20 16:49:34 [NDB] INFO -- NDB Cluster -- DB node 2 2004-06-20 16:49:34 [NDB] INFO -- Version 3.5.0 (beta) -- 2004-06-20 16:49:34 [NDB] INFO -- Start initiated (version 3.5.0) Dbdict: name=sys/def/SYSTAB_0,id=0 Dbdict: name=sys/def/NDB$EVENTS_0,id=2 Dbdict: name=test/def/matteotabella2,id=4 Dbdict: name=test/def/4/PRIMARY,id=6 Dbdict: name=test/def/matteo,id=8 Dbdict: name=test/def/8/PRIMARY,id=10 Dbdict: name=test/def/mytabella,id=12 Dbdict: name=test/def/12/PRIMARY,id=14 2004-06-20 16:50:12 [NDB] INFO -- Started (version 3.5.0) and node #3 stick at 2004-06-20 09:52:11 [NDB] INFO -- Angel pid: 10128 ndb pid: 10129 2004-06-20 09:52:11 [NDB] INFO -- NDB Cluster -- DB node 3 2004-06-20 09:52:11 [NDB] INFO -- Version 3.5.0 (beta) -- 2004-06-20 09:52:11 [NDB] INFO -- Start initiated (version 3.5.0) I can wait a lot of time... but also with tcpdump I cannot see any traffic flowing from/to computer#2 to computer#1, except for the initial connection to the mgm on port 2200 Also the cpu is at 0%. Any hint on where I can take a look? There's a way to enable more logging to the DB node at startup? the hosts are on a local lan, so I can ping between them with hostname or ipaddr and no firewall is involved. the source version installed is the same, the only difference is that one machine runs kernel 2.4.26, the other (the mgm node) 2.6.5 matteo. -- Brancaleoni Matteo Espia Srl