一个比较大的查询:涉及数据量为一个分区,大小在8GB左右。查询大概150秒后控制台报错:
ERROR 5024 (HY000) at line 2: errCode = 2, detailMessage = Execute timeout
请问下这是什么问题?
注:我这边确认关于timeout的相关变量,如quert_timeout等都已设为3600以上
附相关log
be.log
I0729 10:29:34.980911 51410 plan_fragment_executor.cpp:76] Prepare(): query_id=9f8124237ba346ce-b99fa48ee459e363 fragment_instance_id=9f8124237ba346ce-b99fa48ee459e383 backend_num=1
I0729 10:29:36.812634 51351 tablets_channel.cpp:59] open tablets channel: (id=9f8124237ba346ce-b99fa48ee459e363,index_id=6638906), tablets num: 470, timeout(s): 7200
W0729 10:32:07.060565 51339 internal_service.cpp:108] tablet writer add batch failed, message=fail to add batch in load channel. unknown load_id=9f8124237ba346ce-b99fa48ee459e363, id=9f8124237ba346ce-b99fa48ee459e363, index_id=6602315, sender_id=4
W0729 10:32:07.061234 51333 internal_service.cpp:108] tablet writer add batch failed, message=fail to add batch in load channel. unknown load_id=9f8124237ba346ce-b99fa48ee459e363, id=9f8124237ba346ce-b99fa48ee459e363, index_id=6602315, sender_id=7
W0729 10:32:07.151008 51389 tablet_sink.cpp:163] NodeChannel[6602315-5651558] add batch req success but status isn't ok, load_id=9f8124237ba346ce-b99fa48ee459e363, txn_id=147109600, node=10.197.34.155:8060, errmsg=fail to add batch in load channel. unknown load_id=9f8124237ba346ce-b99fa48ee459e363
fe.log(没有非info信息)
2021-07-29 10:32:10,210 INFO (thrift-server-pool-200|632) [QeProcessorImpl.reportExecStatus():109] ReportExecStatus(): fragment_instance_id=9f8124237ba346ce-b99fa48ee459e37b, query id=9f8124237ba346ce-b99fa48ee459e363, backend num: 43, ip: TNetworkAddress(hostname:10.197.141.145, port:27929)
是导入查询。「对端」是什么意思?
看起来你这是一个 导入查询?
tablet writer add batch failed, message=fail to add batch in load channel.
这种类似的错误需要到对端找一下