WIP: Implementing EAT_SSL #15

PariaValizadeh · 2024-10-09T13:03:57Z

This branch included Implementation of the EAT model, which extracted from its repo.

EAT_SSL Info

README file
Add Requirements

Fine_tuned model checkpoints

EAT_large_epoch20 (fine-tuning on AS-2M)
backbone ViT_L, 309M Parameters, 49.5 % mAP
EAT_base_epoch30 (fine-tuning on AS-2M)
backbone ViT_L, 309M Parameters, 48.9 % mAP

model and dependencies

Add EAT model from EAT repo (EAT_audio_classification.py)
Add EAT model dependencies from EAT repo (other files in the models)
Add utils from EAT repo
Add eat.py (the model based on original model, containing preprocessing info for input)

embedding model

Add embedding model for eat

Efforts to get fairseq to run

Tested majiayu000 PR for Python 3.11 and Hydra 1.3.2 support: Add support for Python3.11 facebookresearch/fairseq#5359
To remove remaining Omegaconf errors a small part needs to be changed as described by majiayu000: Add support for dataclasses._MISSING_TYPE omry/omegaconf#1136
Fairseq now works but there are errors when loading the checkpoints from EAT: The corrseponding fairseq "task" can't be found

Reimplementing EAT

To reimplement EAT we use the models code from https://github.com/[nhaH-luaP/PyEat](https://github.com/nhaH-luaP/PyEat) as it only relies on a small local fairseq.

raphaelschwinger · 2024-10-25T10:37:25Z

@XgamerTV, @PariaValizadeh
Maybe this helps: https://github.com/nhaH-luaP/PyEat

…airseq files

XgamerTV · 2024-10-26T15:23:37Z

The repo from Lukas' students was extremely helpful and allowed me to implement the model without too much hassle. The accuracies of the AS FT checkpoint for Watkins are ~63% at the moment, which is not great so we should experiment with different settings and see if the AS/ESC accuracies match the ones from the paper!

raphaelschwinger · 2024-10-27T11:44:37Z

@XgamerTV Perfect, thanks! Good idea to test AS/ESC50 performance!

PariaValizadeh · 2024-10-28T11:23:00Z

@XgamerTV I add new experiment on ESC-50. it has the same accuracy, around 64%

XgamerTV · 2024-10-28T19:24:03Z

The dataset mean (0) and std (0.5) values seemed to have been the problem. Where did you get the values @PariaValizadeh? I used the AS values for now and the watkins accuracy is ~86 now :)

PariaValizadeh · 2024-10-28T19:35:23Z

The dataset mean (0) and std (0.5) values seemed to have been the problem. Where did you get the values @PariaValizadeh? I used the AS values for now and the watkins accuracy is ~86 now :)

I saw that in the article, I also have a doubt on that so I change it to check if it will change the accuracy or not but for the value I put, It haven't changed

PariaValizadeh · 2024-10-28T19:40:36Z

The dataset mean (0) and std (0.5) values seemed to have been the problem. Where did you get the values @PariaValizadeh? I used the AS values for now and the watkins accuracy is ~86 now :)

I saw that in the article, I also have a doubt on that so I change it to check if it will change the accuracy or not but for the value I put, It haven't changed

@XgamerTV In the training details headline, it said 'the audio spectrogram patches are then normalized with a mean value of 0 and a standard deviation of 0.5, following the approach used in previous works.' but not sure about when we put linear classifier on the top

XgamerTV · 2024-10-28T19:43:41Z

Ok I found it in the article but I think the values are weird and it does heavily change the accuracy. Regarding your change of values and not noticing any differences in the accuracy: When using the embedding_datamodule not all changes to the model result in a new extraction of the embeddings. This is because the usual fingerprint method didn't work for the embedding datasets which is why we used the important params in the name. If you change a value and see that no ">> Extracting Embeddings for train Split" is logged the old set has to be deleted manually in the data_birdset folder.

PariaValizadeh · 2024-10-28T19:54:14Z

Ok I found it in the article but I think the values are weird and it does heavily change the accuracy. Regarding your change of values and not noticing any differences in the accuracy: When using the embedding_datamodule not all changes to the model result in a new extraction of the embeddings. This is because the usual fingerprint method didn't work for the embedding datasets which is why we used the important params in the name. If you change a value and see that no ">> Extracting Embeddings for train Split" is logged the old set has to be deleted manually in the data_birdset folder.

Thank you, I will check it tomorrow, also will check if it works for AS values or not.

XgamerTV · 2024-10-28T20:43:10Z

You can check if we can AS if you want but I don't think its necessary. The ESC-50 accuracy is 98.5 which is better than theirs in the Paper and they do Finetuning 🤣 On the ESC the embedding datamodule is a bit weird I will check it next time can you also get the accuracies?

PariaValizadeh · 2024-10-29T07:53:21Z

Ok I found it in the article but I think the values are weird and it does heavily change the accuracy. Regarding your change of values and not noticing any differences in the accuracy: When using the embedding_datamodule not all changes to the model result in a new extraction of the embeddings. This is because the usual fingerprint method didn't work for the embedding datasets which is why we used the important params in the name. If you change a value and see that no ">> Extracting Embeddings for train Split" is logged the old set has to be deleted manually in the data_birdset folder.

I deleted it manually and changed mean and STD which shows no difference in results.

XgamerTV · 2024-10-29T08:24:32Z

Yeah there is something wrong with ESC if you try it with BEANs you should see the difference 🤔

PariaValizadeh · 2024-10-29T08:38:04Z

u try it with BEANs you should see the difference

yes I saw that. I also ask some other questions in Mattermost :Some thing is not completely clear to me about our talk in git. first:by how many epochs you get 98.5 % accuracy on ESC50 because i got 64 for 4 epoch? second: by getting the accuracy, you mean I need to check what's the problem which it doesn't have the paper results or just check the accuracy in different datasets?

PariaValizadeh · 2024-10-29T09:28:34Z

@XgamerTV I also think about getting the accuracy for beats on ESC50, which encounter errors, first asking for embedding size, when I determine it, error for not matching the classifier and embedding matrix

XgamerTV · 2024-10-29T22:13:07Z

I couldn't recreate the accuracies myself and I believe it was a weird caching error.

…rch-group/BioFoundation into EAT_SSL_Implementation

PariaValizadeh · 2024-11-12T11:58:52Z

@XgamerTV I saw d04b852 but when I set that False still using loading data from cache until I delete that one from cache, and for loading it does not split the data into 3 part which strsnge. I add the spliting part myself and it works! maybe I am not on the updated version or you have sth else in your code but for me during loading data, data set just contains test and train folder without updating this in embedding_datamodule.py :
`> def prepare_data(self):

    Same as prepare_data in BaseDataModuleHF but checks if path exists and skips rest otherwise
    
    log.info("Check if preparing has already been done.")
    if self._prepare_done:
        log.info("Skip preparing.")
        return
            # Check if the embeddings for the dataset have already been computed
    if os.path.exists(self.embeddings_save_path):
        log.info(f"Embeddings found in {self.embeddings_save_path}, loading from disk")
        dataset = load_from_disk(self.embeddings_save_path)
    else:
        log.info("Prepare Data")
        dataset = self._load_data()
        ###dataset = self._create_splits(dataset)
        ###log.info("print Data")
        dataset = self._compute_embeddings(dataset)
    dataset = self._preprocess_data(dataset)
    # set the length of the training set to be accessed by the model
    self.len_trainset = len(dataset["train"])
    self._save_dataset_to_disk(dataset)

    # set to done so that lightning does not call it again
    self._prepare_done = True `

this is also the part tha shows it has only 2split :
Repo card metadata block was not found. Setting CardData to empty. [2024-11-12 11:48:41,584][huggingface_hub.repocard][WARNING] - Repo card metadata block was not found. Setting CardData to empty. [2024-11-12 11:48:43,903][birdset.datamodule.embedding_datamodule][INFO] - >> Extracting Embeddings for train Split [2024-11-12 11:48:43,906][birdset.datamodule.embedding_datamodule][INFO] - >> Extracting Embeddings for test Split [2024-11-12 11:48:43,908][birdset.datamodule.embedding_datamodule][INFO] - Saving emebeddings to disk: /workspace/data_birdset/esc50/esc50_processed_embedding_model_audio_mae_True_16000_10 Saving the dataset (1/1 shards): 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1600/1600 [00:00<00:00, 106791.53 examples/s] Saving the dataset (1/1 shards): 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 400/400 [00:00<00:00, 75973.45 examples/s]

…and esc50 dataset

…_SSL_Implementation

add EAT_README

135d015

PariaValizadeh changed the title ~~WIP: Implement Eat_ssl~~ WIP: Implementing EAT_SSL Oct 9, 2024

PariaValizadeh force-pushed the EAT_SSL_Implementation branch 3 times, most recently from f955b2c to 135d015 Compare October 11, 2024 10:27

add requirements_dev& src which shows eval results

6981bad

PariaValizadeh force-pushed the EAT_SSL_Implementation branch from 61e8fa0 to 6981bad Compare October 11, 2024 10:39

PariaValizadeh and others added 4 commits October 14, 2024 11:12

model & DEpendencies

9e371df

eat model & preprocessing

05336cf

eat's Embedding model

3b04aad

First steps to custom model

942b13c

XgamerTV assigned XgamerTV and PariaValizadeh Oct 24, 2024

XgamerTV added 3 commits October 26, 2024 15:18

Removed EAT old git

6f0ef07

Added needed files from models folder of new eat git , added needed f…

9fec8d3

…airseq files

Changed eat model file and config

2554011

XgamerTV and others added 2 commits October 27, 2024 23:56

Pydocs, New experiment

73879e3

check EAT's performance on ESC-50

57474c8

XgamerTV added 2 commits October 28, 2024 18:50

Change checkpoint through argument

363e5ae

Fixed accuracy issues

91da963

XgamerTV and others added 7 commits October 29, 2024 22:14

Small changes

a1f49be

Save work in progress

64b345e

Merge branch 'EAT_SSL_Implementation' of https://github.com/DBD-resea…

d595f14

…rch-group/BioFoundation into EAT_SSL_Implementation

Add new YAML configuration files

321197c

Resolved merge conflicts between main and EAT_SSL_Implementation

5962267

add experiment for beats model on ESC_50 and beans dataset

b6e04b2

Disable caching in map as fingerprint doesnt work correctly

d04b852

PariaValizadeh added 3 commits November 14, 2024 10:16

add cross validation for esc50 dataset and fold as a hyperparameter

9f036f8

add cross_valid and fold parameter to run experiment on efficientnet …

24b389e

…and esc50 dataset

Merge remote-tracking branch 'origin/EAT_SSL_Implementation' into EAT…

3835ba8

…_SSL_Implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Implementing EAT_SSL #15

WIP: Implementing EAT_SSL #15

PariaValizadeh commented Oct 9, 2024 •

edited

Loading

raphaelschwinger commented Oct 25, 2024

XgamerTV commented Oct 26, 2024 •

edited

Loading

raphaelschwinger commented Oct 27, 2024

PariaValizadeh commented Oct 28, 2024

XgamerTV commented Oct 28, 2024

PariaValizadeh commented Oct 28, 2024

PariaValizadeh commented Oct 28, 2024 •

edited

Loading

XgamerTV commented Oct 28, 2024

PariaValizadeh commented Oct 28, 2024 •

edited

Loading

XgamerTV commented Oct 28, 2024

PariaValizadeh commented Oct 29, 2024

XgamerTV commented Oct 29, 2024

PariaValizadeh commented Oct 29, 2024

PariaValizadeh commented Oct 29, 2024

XgamerTV commented Oct 29, 2024

PariaValizadeh commented Nov 12, 2024 •

edited

Loading

WIP: Implementing EAT_SSL #15

Are you sure you want to change the base?

WIP: Implementing EAT_SSL #15

Conversation

PariaValizadeh commented Oct 9, 2024 • edited Loading

EAT_SSL Info

Fine_tuned model checkpoints

model and dependencies

embedding model

Efforts to get fairseq to run

Reimplementing EAT

raphaelschwinger commented Oct 25, 2024

XgamerTV commented Oct 26, 2024 • edited Loading

raphaelschwinger commented Oct 27, 2024

PariaValizadeh commented Oct 28, 2024

XgamerTV commented Oct 28, 2024

PariaValizadeh commented Oct 28, 2024

PariaValizadeh commented Oct 28, 2024 • edited Loading

XgamerTV commented Oct 28, 2024

PariaValizadeh commented Oct 28, 2024 • edited Loading

XgamerTV commented Oct 28, 2024

PariaValizadeh commented Oct 29, 2024

XgamerTV commented Oct 29, 2024

PariaValizadeh commented Oct 29, 2024

PariaValizadeh commented Oct 29, 2024

XgamerTV commented Oct 29, 2024

PariaValizadeh commented Nov 12, 2024 • edited Loading

PariaValizadeh commented Oct 9, 2024 •

edited

Loading

XgamerTV commented Oct 26, 2024 •

edited

Loading

PariaValizadeh commented Oct 28, 2024 •

edited

Loading

PariaValizadeh commented Oct 28, 2024 •

edited

Loading

PariaValizadeh commented Nov 12, 2024 •

edited

Loading