Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] be很脆弱,动不动就宕机 #13403

Open
3 tasks done
zhanghengdashuaibi opened this issue Oct 17, 2022 · 10 comments
Open
3 tasks done

[Feature] be很脆弱,动不动就宕机 #13403

zhanghengdashuaibi opened this issue Oct 17, 2022 · 10 comments
Labels
kind/feature Categorizes issue or PR as related to a new feature.

Comments

@zhanghengdashuaibi
Copy link

zhanghengdashuaibi commented Oct 17, 2022

Search before asking

  • I had searched in the issues and found no similar issues.

Description

doris1.1.2版本
算子任务 很普通的 mysqlcdc to doris 没有多大数据量 就是用来测试的,跑了一个星期 宕机5 6次
be很脆弱,动不动就宕机,太不稳定了?新下载的包部署后测试了两天,be宕机5 6次,还不知道是啥情况宕机,动不动就是rowset报错lib64的
WARNING: Logging before InitGoogleLogging() is written to STDERR
I1017 09:30:42.672709 8381 env.cpp:46] Env init successfully.
*** Aborted at 1665971351 (unix time) try "date -d @1665971351" if you are using GNU date ***
*** SIGSEGV unkown detail explain (@0x0) received by PID 8381 (TID 0x7facd1d47700) from PID 0; stack trace: ***
0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /mnt/disk2/apache-doris/be/src/common/signal_handler.h:420
1# 0x00007FAD2F82D400 in /lib64/libc.so.6
2# doris::BaseFieldtypeTraits<(doris::FieldType)9>::equal(void const*, void const*) at /mnt/disk2/apache-doris/be/src/olap/types.h:491
3# doris::TupleReader::_unique_key_next_row(doris::RowCursor*, doris::MemPool*, doris::ObjectPool*, bool*) at /mnt/disk2/apache-doris/be/src/olap/tuple_reader.cpp:197
4# doris::Merger::merge_rowsets(std::shared_ptrdoris::Tablet, doris::ReaderType, std::vector<std::shared_ptrdoris::RowsetReader, std::allocator<std::shared_ptrdoris::RowsetReader > > const&, doris::RowsetWriter*, doris::Merger::Statistics*) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
5# doris::Compaction::do_compaction_impl(long) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
6# doris::Compaction::do_compaction(long) at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:112
7# doris::CumulativeCompaction::execute_compact_impl() in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
8# doris::Compaction::execute_compact() at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:50
9# doris::Tablet::execute_compaction(doris::CompactionType) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
10# std::_Function_handler<void (), doris::StorageEngine::_submit_compaction_task(std::shared_ptrdoris::Tablet, doris::CompactionType)::$_12>::_M_invoke(std::_Any_data const&) at /mnt/disk2/ygl/installs/ldbtools/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
11# doris::ThreadPool::dispatch_thread() at /mnt/disk2/apache-doris/be/src/util/threadpool.cpp:578
12# doris::Thread::supervise_thread(void*) at /mnt/disk2/apache-doris/be/src/util/thread.cpp:407
13# start_thread in /lib64/libpthread.so.0
14# clone in /lib64/libc.so.6

Use case

No response

Related issues

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@zhanghengdashuaibi zhanghengdashuaibi added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 17, 2022
@Userwhite
Copy link
Contributor

机型不支持某些指令吧..

@dataalive
Copy link
Contributor

Doris的版本是?你可以给更多的信息看看?这种反复挂的应该是某一个特殊的SQL或者什么触发了。

@morningman
Copy link
Contributor

这不是一个好的提问题的方式
@zhanghengdashuaibi

@zhanghengdashuaibi
Copy link
Author

Doris的版本是?你可以给更多的信息看看?这种反复挂的应该是某一个特殊的SQL或者什么触发了。

1.1.2的版本 flinkcdc同步mysql的数据到doris,没有多大数据量就是为了测试,跑了一个星期,宕机了5 6次

WARNING: Logging before InitGoogleLogging() is written to STDERR
I1017 09:30:42.672709 8381 env.cpp:46] Env init successfully.
*** Aborted at 1665971351 (unix time) try "date -d @1665971351" if you are using GNU date ***
*** SIGSEGV unkown detail explain (@0x0) received by PID 8381 (TID 0x7facd1d47700) from PID 0; stack trace: ***
0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /mnt/disk2/apache-doris/be/src/common/signal_handler.h:420
1# 0x00007FAD2F82D400 in /lib64/libc.so.6
2# doris::BaseFieldtypeTraits<(doris::FieldType)9>::equal(void const*, void const*) at /mnt/disk2/apache-doris/be/src/olap/types.h:491
3# doris::TupleReader::_unique_key_next_row(doris::RowCursor*, doris::MemPool*, doris::ObjectPool*, bool*) at /mnt/disk2/apache-doris/be/src/olap/tuple_reader.cpp:197
4# doris::Merger::merge_rowsets(std::shared_ptrdoris::Tablet, doris::ReaderType, std::vector<std::shared_ptrdoris::RowsetReader, std::allocator<std::shared_ptrdoris::RowsetReader > > const&, doris::RowsetWriter*, doris::Merger::Statistics*) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
5# doris::Compaction::do_compaction_impl(long) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
6# doris::Compaction::do_compaction(long) at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:112
7# doris::CumulativeCompaction::execute_compact_impl() in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
8# doris::Compaction::execute_compact() at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:50
9# doris::Tablet::execute_compaction(doris::CompactionType) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
10# std::_Function_handler<void (), doris::StorageEngine::_submit_compaction_task(std::shared_ptrdoris::Tablet, doris::CompactionType)::$_12>::_M_invoke(std::_Any_data const&) at /mnt/disk2/ygl/installs/ldbtools/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
11# doris::ThreadPool::dispatch_thread() at /mnt/disk2/apache-doris/be/src/util/threadpool.cpp:578
12# doris::Thread::supervise_thread(void*) at /mnt/disk2/apache-doris/be/src/util/thread.cpp:407
13# start_thread in /lib64/libpthread.so.0
14# clone in /lib64/libc.so.6

@zhanghengdashuaibi
Copy link
Author

这不是一个好的提问题的方式 @zhanghengdashuaibi

抱歉抱歉 有点着急了

@zhanghengdashuaibi
Copy link
Author

机型不支持某些指令吧..
一个测试的算子任务跑了一个星期,没大多数据量,宕机5 6次 用的1.1.2版本的

WARNING: Logging before InitGoogleLogging() is written to STDERR
I1017 09:30:42.672709 8381 env.cpp:46] Env init successfully.
*** Aborted at 1665971351 (unix time) try "date -d @1665971351" if you are using GNU date ***
*** SIGSEGV unkown detail explain (@0x0) received by PID 8381 (TID 0x7facd1d47700) from PID 0; stack trace: ***
0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /mnt/disk2/apache-doris/be/src/common/signal_handler.h:420
1# 0x00007FAD2F82D400 in /lib64/libc.so.6
2# doris::BaseFieldtypeTraits<(doris::FieldType)9>::equal(void const*, void const*) at /mnt/disk2/apache-doris/be/src/olap/types.h:491
3# doris::TupleReader::_unique_key_next_row(doris::RowCursor*, doris::MemPool*, doris::ObjectPool*, bool*) at /mnt/disk2/apache-doris/be/src/olap/tuple_reader.cpp:197
4# doris::Merger::merge_rowsets(std::shared_ptrdoris::Tablet, doris::ReaderType, std::vector<std::shared_ptrdoris::RowsetReader, std::allocator<std::shared_ptrdoris::RowsetReader > > const&, doris::RowsetWriter*, doris::Merger::Statistics*) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
5# doris::Compaction::do_compaction_impl(long) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
6# doris::Compaction::do_compaction(long) at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:112
7# doris::CumulativeCompaction::execute_compact_impl() in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
8# doris::Compaction::execute_compact() at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:50
9# doris::Tablet::execute_compaction(doris::CompactionType) in /opt/apache-doris-be-1.1.2-bin-x86_64/be/lib/doris_be
10# std::_Function_handler<void (), doris::StorageEngine::_submit_compaction_task(std::shared_ptrdoris::Tablet, doris::CompactionType)::$_12>::_M_invoke(std::_Any_data const&) at /mnt/disk2/ygl/installs/ldbtools/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
11# doris::ThreadPool::dispatch_thread() at /mnt/disk2/apache-doris/be/src/util/threadpool.cpp:578
12# doris::Thread::supervise_thread(void*) at /mnt/disk2/apache-doris/be/src/util/thread.cpp:407
13# start_thread in /lib64/libpthread.so.0
14# clone in /lib64/libc.so.6

@dataalive
Copy link
Contributor

我在 #13411 中已经回复,这个是一个在1.1.3中修复的bug,请升级下看看。

@SpringHuBiWei
Copy link

我的更难受,目前release的就是1.1.2 我就用cdc做简单的一比一 写入,写一两条就挂,还找不到错误信息 就是一直挂be 莫名其妙

@dataalive
Copy link
Contributor

@SpringHuBiWei 这个可以线下看看,这个可能是一个比较简单的问题,可以联系微信: warpdrive1987

@SpringHuBiWei
Copy link

@SpringHuBiWei 这个可以线下看看,这个可能是一个比较简单的问题,可以联系微信: warpdrive1987
hi,大佬,这个我看1.2.3已经release 我今天更新了这个版本之后 be现在是没问题了,目前观察还行 还在测试中,你这个微信号搜不到.....

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature.
Projects
None yet
Development

No branches or pull requests

5 participants