Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature](move-memtable) adapt sink v2 to pipelineX #27004

Closed
wants to merge 22 commits into from

Conversation

kaijchen
Copy link
Contributor

Proposed changes

Issue Number: close #xxx

Further comments

If this is a relatively large or complex change, kick off the discussion at [email protected] by explaining why you chose the solution you did and what alternatives you considered, etc...

@kaijchen
Copy link
Contributor Author

run buildall

1 similar comment
@kaijchen
Copy link
Contributor Author

run buildall

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

};

using RowsForTablet = std::unordered_map<int64_t, Rows>;
inline constexpr char VOLAP_TABLE_SINK_V2[] = "VOlapTableSinkV2";
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: do not declare C-style arrays, use std::array<> instead [modernize-avoid-c-arrays]

inline constexpr char VOLAP_TABLE_SINK_V2[] = "VOlapTableSinkV2";
                 ^

// specific language governing permissions and limitations
// under the License.

#include <brpc/uri.h>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: 'brpc/uri.h' file not found [clang-diagnostic-error]

#include <brpc/uri.h>
         ^

return static_cast<VTabletWriterV2*>(writer)->on_partitions_created(result);
}

Status VTabletWriterV2::_incremental_open_streams(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method '_incremental_open_streams' can be made static [readability-convert-member-functions-to-static]

Suggested change
Status VTabletWriterV2::_incremental_open_streams(
static Status VTabletWriterV2::_incremental_open_streams(

return Status::OK();
}

Status VTabletWriterV2::_open_streams(int64_t src_id) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method '_open_streams' can be made static [readability-convert-member-functions-to-static]

be/src/vec/sink/writer/vtablet_writer_v2.h:126:

-     Status _open_streams(int64_t src_id);
+     static Status _open_streams(int64_t src_id);


Status VTabletWriterV2::_open_streams_to_backend(int64_t dst_id,
::doris::stream_load::LoadStreams& streams) {
auto node_info = _nodes_info->find_node(dst_id);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: 'auto node_info' can be declared as 'const auto *node_info' [readability-qualified-auto]

Suggested change
auto node_info = _nodes_info->find_node(dst_id);
const auto *node_info = _nodes_info->find_node(dst_id);

return Status::OK();
}

void VTabletWriterV2::_build_tablet_node_mapping() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method '_build_tablet_node_mapping' can be made static [readability-convert-member-functions-to-static]

be/src/vec/sink/writer/vtablet_writer_v2.h:132:

-     void _build_tablet_node_mapping();
+     static void _build_tablet_node_mapping();

// under the License.

#pragma once
#include <brpc/controller.h>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: 'brpc/controller.h' file not found [clang-diagnostic-error]

#include <brpc/controller.h>
         ^

#include <gen_cpp/types.pb.h>
#include <glog/logging.h>
#include <google/protobuf/stubs/callback.h>
#include <stddef.h>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: inclusion of deprecated C++ header 'stddef.h'; consider using 'cstddef' instead [modernize-deprecated-headers]

Suggested change
#include <stddef.h>
#include <cstddef>

#include <glog/logging.h>
#include <google/protobuf/stubs/callback.h>
#include <stddef.h>
#include <stdint.h>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: inclusion of deprecated C++ header 'stdint.h'; consider using 'cstdint' instead [modernize-deprecated-headers]

Suggested change
#include <stdint.h>
#include <cstdint>

@doris-robot
Copy link

TPC-H test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
Tpch sf100 test result on commit 9890da2582e016270147d82fb0699fe1c31454f0, data reload: false

run tpch-sf100 query with default conf and session variables
q1	5329	5184	5098	5098
q2	353	178	160	160
q3	2028	2005	1985	1985
q4	1388	1353	1344	1344
q5	3986	3953	4000	3953
q6	254	133	131	131
q7	1458	881	887	881
q8	2764	2774	2758	2758
q9	9719	9801	9519	9519
q10	3477	3519	3558	3519
q11	381	259	259	259
q12	424	285	284	284
q13	4558	4118	4131	4118
q14	320	293	276	276
q15	625	560	526	526
q16	679	579	589	579
q17	1129	1080	1081	1080
q18	8043	7558	7674	7558
q19	1652	1672	1683	1672
q20	526	293	309	293
q21	4651	4327	4340	4327
q22	500	392	421	392
Total cold run time: 54244 ms
Total hot run time: 50712 ms

run tpch-sf100 query with default conf and set session variable runtime_filter_mode=off
q1	5036	4988	5075	4988
q2	318	235	229	229
q3	4043	4006	4000	4000
q4	2778	2774	2760	2760
q5	9662	9589	9589	9589
q6	253	124	124	124
q7	3057	2521	2542	2521
q8	4850	4837	4847	4837
q9	13360	13090	13156	13090
q10	4093	4162	4186	4162
q11	749	657	675	657
q12	982	824	837	824
q13	4271	3861	3867	3861
q14	380	355	343	343
q15	587	525	550	525
q16	760	662	692	662
q17	3922	3821	3882	3821
q18	9656	9427	9555	9427
q19	1775	1778	1754	1754
q20	2408	2084	2059	2059
q21	8871	8813	8750	8750
q22	900	828	832	828
Total cold run time: 82711 ms
Total hot run time: 79811 ms

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.82 seconds
stream load tsv: 573 seconds loaded 74807831229 Bytes, about 124 MB/s
stream load json: 19 seconds loaded 2358488459 Bytes, about 118 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.6 seconds inserted 10000000 Rows, about 349K ops/s
storage size: 17097250889 Bytes

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.25 seconds
stream load tsv: 558 seconds loaded 74807831229 Bytes, about 127 MB/s
stream load json: 18 seconds loaded 2358488459 Bytes, about 124 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 34 seconds loaded 861443392 Bytes, about 24 MB/s
insert into select: 28.5 seconds inserted 10000000 Rows, about 350K ops/s
storage size: 17096432852 Bytes

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.73% (8420/22922)
Line Coverage: 29.27% (68409/233734)
Region Coverage: 27.87% (35358/126878)
Branch Coverage: 24.65% (18068/73294)
Coverage Report: http://coverage.selectdb-in.cc/coverage/3fa6724be04ec984f286f0c79011ec0009d851dd_3fa6724be04ec984f286f0c79011ec0009d851dd/report/index.html

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.72% (8418/22922)
Line Coverage: 29.26% (68386/233734)
Region Coverage: 27.86% (35345/126878)
Branch Coverage: 24.64% (18061/73294)
Coverage Report: http://coverage.selectdb-in.cc/coverage/9890da2582e016270147d82fb0699fe1c31454f0_9890da2582e016270147d82fb0699fe1c31454f0/report/index.html

@kaijchen
Copy link
Contributor Author

run buildall

1 similar comment
@kaijchen
Copy link
Contributor Author

run buildall

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

Comment on lines +62 to +58
namespace doris {

namespace vectorized {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: nested namespaces can be concatenated [modernize-concat-nested-namespaces]

Suggested change
namespace doris {
namespace vectorized {
namespace doris::vectorized {

be/src/vec/sink/writer/vtablet_writer_v2.cpp:549:

- } // namespace vectorized
- } // namespace doris
+ } // namespace doris

@kaijchen
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.78% (8417/22883)
Line Coverage: 29.27% (68391/233623)
Region Coverage: 27.88% (35359/126824)
Branch Coverage: 24.65% (18065/73298)
Coverage Report: http://coverage.selectdb-in.cc/coverage/c5b855b114e50379617dea3c470f7325013d5c6f_c5b855b114e50379617dea3c470f7325013d5c6f/report/index.html

@kaijchen
Copy link
Contributor Author

run buildall

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

return std::make_shared<OlapTableSinkV2Operator>(this, _sink);
}

Status OlapTableSinkV2LocalState::init(RuntimeState* state, LocalSinkStateInfo& info) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'init' can be made static [readability-convert-member-functions-to-static]

be/src/pipeline/exec/olap_table_sink_v2_operator.h:54:

-     Status init(RuntimeState* state, LocalSinkStateInfo& info) override;
+     static Status init(RuntimeState* state, LocalSinkStateInfo& info) override;

return Status::OK();
}

Status OlapTableSinkV2LocalState::close(RuntimeState* state, Status exec_status) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'close' can be made static [readability-convert-member-functions-to-static]

be/src/pipeline/exec/olap_table_sink_v2_operator.h:61:

-     Status close(RuntimeState* state, Status exec_status) override;
+     static Status close(RuntimeState* state, Status exec_status) override;

OlapTableSinkV2LocalState(DataSinkOperatorXBase* parent, RuntimeState* state)
: Base(parent, state) {};
Status init(RuntimeState* state, LocalSinkStateInfo& info) override;
Status open(RuntimeState* state) override {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'open' can be made static [readability-convert-member-functions-to-static]

Suggested change
Status open(RuntimeState* state) override {
static Status open(RuntimeState* state) override {

_group_commit(group_commit),
_pool(pool) {};

Status init(const TDataSink& thrift_sink) override {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'init' can be made static [readability-convert-member-functions-to-static]

Suggested change
Status init(const TDataSink& thrift_sink) override {
static Status init(const TDataSink& thrift_sink) override {

return Status::OK();
}

Status prepare(RuntimeState* state) override {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'prepare' can be made static [readability-convert-member-functions-to-static]

Suggested change
Status prepare(RuntimeState* state) override {
static Status prepare(RuntimeState* state) override {

return vectorized::VExpr::prepare(_output_vexpr_ctxs, state, _row_desc);
}

Status open(RuntimeState* state) override {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'open' can be made static [readability-convert-member-functions-to-static]

Suggested change
Status open(RuntimeState* state) override {
static Status open(RuntimeState* state) override {

return vectorized::VExpr::open(_output_vexpr_ctxs, state);
}

Status sink(RuntimeState* state, vectorized::Block* in_block,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method 'sink' can be made static [readability-convert-member-functions-to-static]

Suggested change
Status sink(RuntimeState* state, vectorized::Block* in_block,
static Status sink(RuntimeState* state, vectorized::Block* in_block,

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.04 seconds
stream load tsv: 555 seconds loaded 74807831229 Bytes, about 128 MB/s
stream load json: 19 seconds loaded 2358488459 Bytes, about 118 MB/s
stream load orc: 64 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.8 seconds inserted 10000000 Rows, about 347K ops/s
storage size: 17095888638 Bytes

@doris-robot
Copy link

TPC-H test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
Tpch sf100 test result on commit c5b855b114e50379617dea3c470f7325013d5c6f, data reload: false

run tpch-sf100 query with default conf and session variables
q1	5265	5090	5070	5070
q2	352	169	160	160
q3	2041	1880	1918	1880
q4	1365	1247	1252	1247
q5	3928	3901	3983	3901
q6	245	124	127	124
q7	1404	873	899	873
q8	2722	2780	2748	2748
q9	12324	9722	9548	9548
q10	3435	3530	3500	3500
q11	383	245	256	245
q12	444	284	279	279
q13	4575	3824	3765	3765
q14	325	282	287	282
q15	660	568	558	558
q16	668	595	584	584
q17	1125	968	947	947
q18	7825	7489	7437	7437
q19	1666	1672	1673	1672
q20	566	313	290	290
q21	4444	3992	3980	3980
q22	516	415	402	402
Total cold run time: 56278 ms
Total hot run time: 49492 ms

run tpch-sf100 query with default conf and set session variable runtime_filter_mode=off
q1	5043	5009	5033	5009
q2	327	250	235	235
q3	3999	3972	3949	3949
q4	2780	2757	2763	2757
q5	9521	9584	9638	9584
q6	244	125	128	125
q7	2662	2272	2248	2248
q8	4825	4859	4810	4810
q9	13301	13077	13099	13077
q10	4043	4177	4197	4177
q11	767	663	640	640
q12	984	871	846	846
q13	4260	3562	3582	3562
q14	382	362	365	362
q15	625	540	554	540
q16	771	703	706	703
q17	3914	3871	3937	3871
q18	9500	8988	8976	8976
q19	1863	1762	1769	1762
q20	2393	2112	2040	2040
q21	8635	8540	8498	8498
q22	921	840	879	840
Total cold run time: 81760 ms
Total hot run time: 78611 ms

@kaijchen
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.16 seconds
stream load tsv: 563 seconds loaded 74807831229 Bytes, about 126 MB/s
stream load json: 18 seconds loaded 2358488459 Bytes, about 124 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.7 seconds inserted 10000000 Rows, about 348K ops/s
storage size: 17098790590 Bytes

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.76% (8410/22881)
Line Coverage: 29.27% (68380/233650)
Region Coverage: 27.88% (35350/126801)
Branch Coverage: 24.66% (18072/73290)
Coverage Report: http://coverage.selectdb-in.cc/coverage/c9dfc1a63cafb1c94f3fd9011e9d65b4259a2b64_c9dfc1a63cafb1c94f3fd9011e9d65b4259a2b64/report/index.html

@kaijchen
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.76% (8412/22881)
Line Coverage: 29.28% (68390/233551)
Region Coverage: 27.89% (35350/126769)
Branch Coverage: 24.66% (18072/73286)
Coverage Report: http://coverage.selectdb-in.cc/coverage/c9dfc1a63cafb1c94f3fd9011e9d65b4259a2b64_c9dfc1a63cafb1c94f3fd9011e9d65b4259a2b64/report/index.html

@doris-robot
Copy link

TPC-H test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
Tpch sf100 test result on commit c9dfc1a63cafb1c94f3fd9011e9d65b4259a2b64, data reload: false

run tpch-sf100 query with default conf and session variables
q1	4884	4650	4610	4610
q2	364	150	159	150
q3	2046	1979	1961	1961
q4	1392	1246	1253	1246
q5	3956	3948	3984	3948
q6	252	132	132	132
q7	1396	886	881	881
q8	2755	2776	2781	2776
q9	9860	9653	9421	9421
q10	3429	3493	3530	3493
q11	371	252	255	252
q12	439	296	291	291
q13	4557	3814	3851	3814
q14	332	286	296	286
q15	595	550	524	524
q16	672	583	575	575
q17	1122	950	901	901
q18	7812	7401	7322	7322
q19	1651	1661	1653	1653
q20	530	316	298	298
q21	4419	3932	3980	3932
q22	475	379	385	379
Total cold run time: 53309 ms
Total hot run time: 48845 ms

run tpch-sf100 query with default conf and set session variable runtime_filter_mode=off
q1	4557	4537	4553	4537
q2	337	275	288	275
q3	3980	3967	3970	3967
q4	2681	2676	2683	2676
q5	9497	9490	9539	9490
q6	244	123	127	123
q7	2641	2279	2292	2279
q8	4501	4548	4528	4528
q9	13086	13055	13098	13055
q10	4088	4177	4193	4177
q11	766	649	700	649
q12	977	824	818	818
q13	4258	3592	3552	3552
q14	368	340	341	340
q15	580	541	517	517
q16	739	673	683	673
q17	3922	3850	3850	3850
q18	9379	8947	8786	8786
q19	1831	1777	1758	1758
q20	2401	2065	2044	2044
q21	8704	8415	8552	8415
q22	859	836	805	805
Total cold run time: 80396 ms
Total hot run time: 77314 ms

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 44.72 seconds
stream load tsv: 568 seconds loaded 74807831229 Bytes, about 125 MB/s
stream load json: 18 seconds loaded 2358488459 Bytes, about 124 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.2 seconds inserted 10000000 Rows, about 354K ops/s
storage size: 17099454139 Bytes

@kaijchen
Copy link
Contributor Author

run buildall

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

Comment on lines +56 to +58
namespace doris {

namespace vectorized {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: nested namespaces can be concatenated [modernize-concat-nested-namespaces]

Suggested change
namespace doris {
namespace vectorized {
namespace doris::vectorized {

be/src/vec/sink/writer/vtablet_writer_v2.cpp:539:

- } // namespace vectorized
- } // namespace doris
+ } // namespace doris

@doris-robot
Copy link

TPC-H test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
Tpch sf100 test result on commit f1aece4aed5b56ea55b65e0dc9f8eee7c1ecf60b, data reload: false

run tpch-sf100 query with default conf and session variables
q1	4951	4701	4671	4671
q2	366	148	159	148
q3	2034	1890	1902	1890
q4	1392	1252	1265	1252
q5	3956	3983	4023	3983
q6	251	131	133	131
q7	1396	874	891	874
q8	2757	2782	2766	2766
q9	9731	9682	9565	9565
q10	3445	3578	3509	3509
q11	385	250	247	247
q12	448	297	294	294
q13	4606	3820	3776	3776
q14	313	285	284	284
q15	592	539	530	530
q16	660	592	571	571
q17	1136	974	951	951
q18	7712	7349	7399	7349
q19	1668	1677	1676	1676
q20	574	314	310	310
q21	4429	4010	3968	3968
q22	482	382	390	382
Total cold run time: 53284 ms
Total hot run time: 49127 ms

run tpch-sf100 query with default conf and set session variable runtime_filter_mode=off
q1	4541	4589	4614	4589
q2	339	243	265	243
q3	4003	3996	3982	3982
q4	2720	2697	2698	2697
q5	9649	9697	9686	9686
q6	242	124	126	124
q7	2618	2292	2310	2292
q8	4446	4466	4450	4450
q9	13232	13229	13162	13162
q10	4063	4177	4183	4177
q11	777	656	660	656
q12	978	817	808	808
q13	4295	3599	3560	3560
q14	377	347	349	347
q15	583	532	540	532
q16	748	677	678	677
q17	3901	3901	3891	3891
q18	9514	8901	8840	8840
q19	1783	1781	1790	1781
q20	2421	2075	2041	2041
q21	8925	8608	8463	8463
q22	891	831	825	825
Total cold run time: 81046 ms
Total hot run time: 77823 ms

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.52 seconds
stream load tsv: 568 seconds loaded 74807831229 Bytes, about 125 MB/s
stream load json: 19 seconds loaded 2358488459 Bytes, about 118 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 29.1 seconds inserted 10000000 Rows, about 343K ops/s
storage size: 17099524493 Bytes

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.76% (8411/22881)
Line Coverage: 29.27% (68369/233552)
Region Coverage: 27.87% (35335/126769)
Branch Coverage: 24.65% (18063/73286)
Coverage Report: http://coverage.selectdb-in.cc/coverage/f1aece4aed5b56ea55b65e0dc9f8eee7c1ecf60b_f1aece4aed5b56ea55b65e0dc9f8eee7c1ecf60b/report/index.html

@kaijchen
Copy link
Contributor Author

run buildall

@kaijchen
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.76% (8411/22881)
Line Coverage: 29.28% (68368/233523)
Region Coverage: 27.88% (35341/126771)
Branch Coverage: 24.65% (18063/73290)
Coverage Report: http://coverage.selectdb-in.cc/coverage/140078bbe655da111dd5df89e9242883e3d3817d_140078bbe655da111dd5df89e9242883e3d3817d/report/index.html

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

@@ -1005,4 +1010,23 @@ Status PipelineXFragmentContext::send_report(bool done) {
std::placeholders::_2)},
shared_from_this());
}

bool PipelineXFragmentContext::_has_inverted_index_or_partial_update(TOlapTableSink sink) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: method '_has_inverted_index_or_partial_update' can be made static [readability-convert-member-functions-to-static]

be/src/pipeline/pipeline_x/pipeline_x_fragment_context.h:148:

-     bool _has_inverted_index_or_partial_update(TOlapTableSink sink);
+     static bool _has_inverted_index_or_partial_update(TOlapTableSink sink);

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.77% (8413/22882)
Line Coverage: 29.29% (68393/233541)
Region Coverage: 27.89% (35355/126785)
Branch Coverage: 24.65% (18069/73304)
Coverage Report: http://coverage.selectdb-in.cc/coverage/401ae70cfd9a4c325d4b18c7211b596b911786db_401ae70cfd9a4c325d4b18c7211b596b911786db/report/index.html

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.75 seconds
stream load tsv: 581 seconds loaded 74807831229 Bytes, about 122 MB/s
stream load json: 18 seconds loaded 2358488459 Bytes, about 124 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 33 seconds loaded 861443392 Bytes, about 24 MB/s
insert into select: 28.0 seconds inserted 10000000 Rows, about 357K ops/s
storage size: 17100757683 Bytes

@doris-robot
Copy link

TPC-H test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
Tpch sf100 test result on commit 401ae70cfd9a4c325d4b18c7211b596b911786db, data reload: false

run tpch-sf100 query with default conf and session variables
q1	4930	4688	4687	4687
q2	357	162	159	159
q3	2044	1947	1920	1920
q4	1394	1275	1258	1258
q5	3923	3928	3997	3928
q6	248	132	130	130
q7	1388	873	878	873
q8	2766	2791	2778	2778
q9	9671	9554	9484	9484
q10	3467	3532	3513	3513
q11	394	238	251	238
q12	442	293	299	293
q13	4576	3858	3790	3790
q14	324	277	282	277
q15	591	524	521	521
q16	673	587	581	581
q17	1137	955	927	927
q18	7770	7355	7421	7355
q19	1685	1681	1693	1681
q20	555	339	296	296
q21	4332	4037	3950	3950
q22	479	371	376	371
Total cold run time: 53146 ms
Total hot run time: 49010 ms

run tpch-sf100 query with default conf and set session variable runtime_filter_mode=off
q1	4625	4587	4593	4587
q2	333	215	243	215
q3	4001	3995	3989	3989
q4	2706	2682	2699	2682
q5	9606	9573	9580	9573
q6	246	119	123	119
q7	2599	2288	2274	2274
q8	4439	4425	4444	4425
q9	13164	13050	13176	13050
q10	4079	4167	4183	4167
q11	804	653	657	653
q12	976	813	828	813
q13	4262	3524	3561	3524
q14	381	338	349	338
q15	585	526	520	520
q16	731	669	665	665
q17	3884	3925	3820	3820
q18	9523	8961	9119	8961
q19	1819	1800	1766	1766
q20	2396	2052	2047	2047
q21	8748	8738	8802	8738
q22	940	835	757	757
Total cold run time: 80847 ms
Total hot run time: 77683 ms

@doris-robot
Copy link

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.11 seconds
stream load tsv: 578 seconds loaded 74807831229 Bytes, about 123 MB/s
stream load json: 18 seconds loaded 2358488459 Bytes, about 124 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 28.7 seconds inserted 10000000 Rows, about 348K ops/s
storage size: 17101251385 Bytes

@kaijchen kaijchen closed this Nov 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants