You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Build models to classify the functions of urban areas with data of satellite images({Area_ID}.jpg) and user behavior({Area_ID}.txt) from given geographical areas.
Tables of the functions of urban areas:
CategoryID
Functions of Areas
001
Residential area
002
School
003
Industrial park
004
Railway station
005
Airport
006
Park
007
Shopping area
008
Administrative district
009
Hospital
For more Detailed Task descriptions, please go to 赛题详情
Environmental Requirements
OS & GPU Configurations
Ubuntu 18.04.1 LTS
GTX 1080Ti x 1 + GTX 1060M x 1
Baidu AI Studio (Tesla V100 x 36, For Model I 36 Networks Training)
Model I: Several Netural Network Stacking (DeepLearning)
# Model Descriptions:
8 Nets
5 folds Stacking
Trained Networks = 7*5+1 (We only trained fold1 on Net6 because of Its Low Local Acc).
# Result:
After 36 NN Stacking, we reached a top online acc of 77.08%.
# Steps:
1) Txt Identical Check (Completed)
2) Multivoters based on Total Times A user Appeared in Same Category (Completed)
3) Multivoters based on Total Hours A user Appeared in Same Category (Only 3/2000 json files were processed)
# Result:
After above 3 steps, we got a submission of 81.62%. (81.6200%.txt)
# Notes:
[1] Step 3 was not completed Because of Limited Time and Computation Resources, only 3/2000 data was processed.
[2] In this project we abbreviate {Preliminary,Semi-Final}-{Train,Test}-Datasets as {P,S}{Tr,Te} ==> {PTr,PTe,STr,STe}.
Steps
Content Descriptions
Oringinal Score
After Improved
Source Code
(1)
Utilize Identical txts' Categories in PTr & STr to provide answers for STe
Model III: Merge & Rebalance the Predicts in Submissions (Post-processing)
Directly Modify Submission.txt:
We Compared predicts in 81.2440%.txt and 81.6200%.txt, finding that 001 was TOO MANY (4k more than True Value), 003/005 were a bit more-predicted, and others were all less-predicted.
Category Distributions in Our Submissions (Take 81.6200%.txt for Example)
Therefore, We Merged the Predicts among our ex-Top2 Submissions. (81.6200%.txt & 81.2440%.txt)
Strategy & Rules:
1) Compare & Merge the predicts in the two txt file by Replacing those '001's to other less-predicted categories.
2) While the two gives the same prediction or both predictions are in More-Predicted Categories ['001','003','005'], Choose the answer in 81.6200%.txt as result Beacause of its Higher Acc.
After this operation, we got our final best submission 82.1800%.txt, which reached 82.18%.