Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

working chromo-sweep #46

Draft
wants to merge 1 commit into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
20 changes: 20 additions & 0 deletions generated/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
# Directory for generated test files.
Files are generated with org.biodatageeks.rangejoins.generation.TestDataGenerator

## Generation parameters
Data is generated semi-randomly with following parameters:

- amount - amount of intervals to generate
- maxOffset - max interval to start generation
- maxRange - max size of interval
- maxStep - max space between intervals

## Datasets
Parameters for generated datasets
1. Denser Labels (expecting less nulls after join)
- queries [1000, 50, 5, 30]
- labes [1000, 50, 15, 20]

2. Denser Queries (expecting more nulls after join)
- queries [1000, 50, 15, 20]
- labels [1000, 50, 5, 30]
126 changes: 126 additions & 0 deletions generated/denser-labels/results/part-00000
Original file line number Diff line number Diff line change
@@ -0,0 +1,126 @@
11078,11080,q625,11071,11085,l633
11087,11091,q626,0,0,null
11098,11101,q627,0,0,null
11112,11113,q628,0,0,null
11138,11142,q629,11134,11141,l636
11166,11169,q630,0,0,null
11175,11176,q631,0,0,null
11197,11201,q632,11192,11198,l640
11197,11201,q632,11200,11213,l641
11230,11232,q633,0,0,null
11253,11257,q634,11248,11254,l644
11275,11276,q635,11273,11285,l645
11290,11294,q636,11292,11306,l646
11301,11305,q637,11292,11306,l646
11319,11321,q638,0,0,null
11328,11331,q639,11325,11334,l647
11345,11346,q640,0,0,null
11360,11361,q641,11353,11363,l648
11383,11385,q642,11380,11386,l650
11402,11405,q643,11398,11408,l651
11429,11432,q644,11427,11437,l652
11435,11437,q645,11427,11437,l652
11441,11445,q646,0,0,null
11474,11475,q647,11470,11476,l654
11504,11506,q648,0,0,null
11515,11517,q649,0,0,null
11538,11541,q650,11538,11542,l659
11553,11557,q651,11554,11556,l660
11575,11578,q652,11575,11581,l661
11598,11600,q653,11598,11610,l662
11611,11614,q654,0,0,null
11615,11619,q655,0,0,null
11625,11629,q656,11621,11628,l663
11642,11643,q657,0,0,null
11656,11659,q658,11655,11660,l666
11677,11680,q659,11677,11690,l668
11709,11710,q660,11707,11711,l669
11722,11726,q661,11716,11724,l670
11729,11733,q662,0,0,null
11735,11738,q663,11736,11740,l671
11766,11768,q664,11760,11768,l673
11770,11774,q665,0,0,null
11781,11782,q666,11779,11784,l674
11796,11800,q667,11798,11812,l675
11811,11812,q668,11798,11812,l675
11819,11822,q669,0,0,null
11825,11828,q670,0,0,null
11835,11839,q671,11835,11839,l677
11866,11867,q672,0,0,null
11879,11882,q673,11879,11882,l680
11900,11904,q674,0,0,null
11921,11924,q675,0,0,null
11925,11928,q676,0,0,null
11941,11943,q677,11935,11941,l684
11947,11949,q678,0,0,null
11967,11969,q679,11959,11967,l686
11975,11979,q680,0,0,null
11986,11988,q681,11984,11987,l687
12000,12004,q682,12001,12010,l688
12023,12025,q683,12016,12028,l689
12047,12051,q684,12043,12049,l691
12056,12060,q685,0,0,null
12069,12071,q686,12066,12069,l692
12076,12080,q687,0,0,null
12082,12086,q688,0,0,null
12092,12094,q689,12087,12098,l693
12100,12104,q690,12100,12108,l694
12107,12108,q691,12100,12108,l694
12109,12110,q692,0,0,null
12139,12141,q693,0,0,null
12150,12153,q694,12152,12157,l697
12154,12155,q695,12152,12157,l697
12184,12187,q696,12180,12191,l699
12206,12209,q697,12206,12217,l701
12220,12221,q698,0,0,null
12224,12228,q699,12226,12238,l702
12246,12248,q700,12240,12248,l703
12271,12273,q701,0,0,null
12292,12293,q702,0,0,null
12300,12302,q703,12302,12313,l706
12329,12331,q704,0,0,null
12334,12336,q705,12335,12346,l708
12337,12339,q706,12335,12346,l708
12348,12351,q707,0,0,null
12370,12373,q708,12364,12377,l710
12375,12376,q709,12364,12377,l710
12391,12394,q710,0,0,null
12407,12411,q711,12403,12415,l712
12432,12433,q712,0,0,null
12462,12464,q713,12455,12467,l715
12491,12494,q714,12494,12502,l717
12500,12503,q715,12494,12502,l717
12527,12530,q716,12521,12530,l718
12550,12553,q717,12538,12550,l719
12579,12583,q718,12566,12580,l720
12612,12614,q719,0,0,null
12616,12619,q720,12615,12627,l722
12647,12648,q721,12643,12654,l723
12656,12659,q722,0,0,null
12674,12675,q723,12666,12677,l724
12684,12686,q724,12680,12685,l725
12691,12694,q725,0,0,null
12718,12721,q726,12716,12728,l727
12735,12738,q727,0,0,null
12745,12746,q728,12745,12748,l728
12762,12763,q729,0,0,null
12765,12767,q730,12765,12779,l729
12786,12787,q731,12787,12788,l730
12800,12803,q732,0,0,null
12812,12814,q733,12804,12814,l732
12827,12828,q734,12823,12829,l733
12856,12857,q735,12855,12866,l735
12883,12885,q736,12877,12884,l736
12905,12908,q737,0,0,null
12921,12925,q738,12923,12934,l738
12939,12940,q739,12937,12940,l739
12944,12946,q740,0,0,null
12959,12963,q741,12962,12968,l741
12976,12977,q742,0,0,null
12981,12984,q743,0,0,null
12996,13000,q744,12987,12999,l742
13020,13024,q745,13009,13020,l743
13042,13043,q746,13035,13044,l744
13072,13075,q747,13059,13073,l745
13103,13106,q748,13103,13115,l747
13135,13136,q749,0,0,null
126 changes: 126 additions & 0 deletions generated/denser-labels/results/part-00001
Original file line number Diff line number Diff line change
@@ -0,0 +1,126 @@
13137,13139,q750,0,0,null
13164,13168,q751,13162,13170,l750
13179,13182,q752,13179,13184,l751
13195,13198,q753,13198,13203,l752
13214,13217,q754,0,0,null
13241,13242,q755,0,0,null
13253,13255,q756,13251,13256,l754
13275,13277,q757,0,0,null
13286,13287,q758,13279,13288,l756
13291,13293,q759,0,0,null
13318,13319,q760,13314,13319,l759
13328,13331,q761,13328,13336,l760
13334,13337,q762,13328,13336,l760
13363,13367,q763,13356,13363,l762
13393,13397,q764,0,0,null
13413,13416,q765,0,0,null
13436,13438,q766,13428,13439,l765
13446,13449,q767,13443,13452,l766
13468,13471,q768,0,0,null
13498,13499,q769,13496,13502,l770
13514,13516,q770,0,0,null
13540,13543,q771,13534,13546,l772
13566,13569,q772,13565,13570,l773
13587,13589,q773,13589,13601,l774
13618,13622,q774,13606,13619,l775
13623,13624,q775,0,0,null
13651,13652,q776,13650,13656,l777
13678,13680,q777,13670,13680,l779
13704,13705,q778,0,0,null
13716,13718,q779,13708,13717,l781
13723,13725,q780,13722,13724,l782
13745,13746,q781,13745,13746,l784
13763,13766,q782,13754,13767,l785
13787,13788,q783,13785,13795,l787
13793,13795,q784,13785,13795,l787
13811,13815,q785,0,0,null
13832,13834,q786,13827,13839,l789
13855,13858,q787,0,0,null
13880,13884,q788,13878,13881,l792
13912,13916,q789,0,0,null
13943,13947,q790,13937,13951,l795
13956,13959,q791,0,0,null
13969,13971,q792,13965,13979,l796
13973,13974,q793,13965,13979,l796
13984,13988,q794,13982,13995,l797
14009,14011,q795,0,0,null
14034,14036,q796,0,0,null
14064,14065,q797,0,0,null
14069,14070,q798,0,0,null
14097,14101,q799,0,0,null
14109,14111,q800,14108,14109,l802
14118,14121,q801,0,0,null
14133,14135,q802,14126,14137,l803
14150,14151,q803,14142,14156,l804
14180,14184,q804,0,0,null
14185,14186,q805,0,0,null
14187,14190,q806,14188,14202,l806
14197,14199,q807,14188,14202,l806
14211,14212,q808,14203,14216,l807
14231,14235,q809,14233,14237,l809
14240,14242,q810,0,0,null
14256,14257,q811,0,0,null
14271,14275,q812,14267,14277,l811
14299,14302,q813,14302,14307,l813
14313,14314,q814,0,0,null
14336,14337,q815,14334,14342,l815
14364,14366,q816,14356,14369,l816
14385,14386,q817,0,0,null
14391,14393,q818,14393,14404,l818
14395,14399,q819,14393,14404,l818
14409,14412,q820,0,0,null
14430,14432,q821,0,0,null
14441,14445,q822,14436,14442,l820
14465,14469,q823,14467,14477,l822
14472,14476,q824,14467,14477,l822
14486,14489,q825,0,0,null
14518,14522,q826,14511,14522,l824
14524,14526,q827,0,0,null
14529,14533,q828,0,0,null
14559,14562,q829,14562,14568,l827
14570,14571,q830,0,0,null
14573,14575,q831,14574,14576,l828
14585,14588,q832,0,0,null
14617,14618,q833,14616,14619,l830
14647,14651,q834,14643,14652,l832
14670,14673,q835,14669,14677,l833
14689,14691,q836,14690,14692,l834
14698,14702,q837,0,0,null
14726,14729,q838,14729,14733,l836
14744,14745,q839,14738,14747,l837
14767,14770,q840,14765,14767,l839
14767,14770,q840,14768,14778,l840
14797,14800,q841,14796,14809,l841
14811,14812,q842,0,0,null
14825,14828,q843,14825,14839,l843
14839,14840,q844,14825,14839,l843
14853,14855,q845,14850,14857,l844
14884,14885,q846,14879,14891,l846
14912,14914,q847,14912,14917,l848
14925,14928,q848,14928,14933,l849
14954,14957,q849,14950,14956,l850
14964,14968,q850,14965,14968,l851
14994,14996,q851,14982,14994,l852
15013,15017,q852,0,0,null
15043,15046,q853,15041,15055,l855
15067,15068,q854,0,0,null
15086,15089,q855,15087,15097,l857
15104,15108,q856,0,0,null
15129,15130,q857,0,0,null
15145,15148,q858,15136,15150,l859
15170,15173,q859,0,0,null
15193,15194,q860,0,0,null
15221,15224,q861,0,0,null
15225,15229,q862,15226,15238,l863
15246,15247,q863,0,0,null
15261,15265,q864,0,0,null
15269,15272,q865,0,0,null
15300,15302,q866,15294,15306,l866
15322,15323,q867,15322,15323,l867
15326,15329,q868,0,0,null
15354,15358,q869,15351,15354,l870
15373,15374,q870,0,0,null
15397,15398,q871,15390,15401,l872
15403,15406,q872,0,0,null
15422,15423,q873,15416,15426,l873
15440,15442,q874,15442,15451,l874
Loading