Add Bow histogram computation #26

ovysotska · 2023-10-24T15:39:54Z

This PR adds the Python code to:

train the bag of words vocabulary
extract image histograms based on pre-trained bag of words vocabulary
some tests 😄

In the testing mode, the code takes a folder with images and returns the np.array of Bow features stored in the specified file.

The computed features can be used in the current codebase further, by first turning them into proto files, and then the full pipeline for image_sequence_localizer can be used as usual.

… can train a vocabulary of words. Based on SIFT descriptors

ovysotska · 2023-10-24T15:41:35Z

Hey @niosus would you have time to review this one?

niosus

A solid PR, @ovysotska! I left a couple of nitpicks here and there, but feel free to ignore them. I think the code is easy to read and the intent is clear. Added plus for having a couple of tests.

src/python/bow/bow.py

niosus · 2023-10-31T09:24:08Z

src/python/bow/bow.py

+    return trainImageFiles
+
+
+def rescaleImageIfNeeded(image):


nit: very minor, but you seem to have docs for the other functions and not for these. Maybe add docstring to these too?

niosus · 2023-10-31T09:26:43Z

src/python/bow/bow.py

+
+
+def computeIDF(descriptorsPerImage, clusters):
+    """Compute inverse document frequence (IDF). Here means in how many images does the word occur.


Here means in how many images does the word occur.

nit: This reads a bit strange. What did you mean to say here?

I meant the IDF (which is inverse document frequency) , here stands for how many images (not documents) does the word occur in.

niosus · 2023-10-31T09:29:57Z

src/python/bow/bow.py

+    descriptors = [
+        descriptor for descriptors in descriptorsPerImage for descriptor in descriptors
+    ]
+    descriptors = np.array(descriptors)


I don't really know how to improve this, but the naming seems to be overloaded here. The word descriptors is very overused here. Maybe use more descriptive names?

It is the best. I also don't know now what that suppose to be :)

niosus · 2023-10-31T09:32:28Z

src/python/bow/bow.py

+    idfs = computeIDF(descriptorsPerImage, words)
+
+    plt.bar(range(0, len(idfs)), idfs)
+    plt.savefig("idf_" + str(kDefaultClusterSize) + ".png")


nit: Do you want to always save these images? Or should it be hidden behind some flag?

Neh, not really important. Will just remove it and reduce the dependencies list.

niosus · 2023-10-31T09:35:11Z

src/python/bow/bow.py

+
+
+def getVocabulary(imageTrainFolder, vocabularyFile):
+    if vocabularyFile is not None:


nit: you can simplify this to:

if vocabularyFile and vocabularyFile.exists():

added python code to compute Bow histograms for images. The code also…

3924270

… can train a vocabulary of words. Based on SIFT descriptors

niosus approved these changes Oct 31, 2023

View reviewed changes

resolved comments

d77abeb

ovysotska merged commit f7e1f5e into main Nov 2, 2023
2 checks passed

ovysotska deleted the add_sift_bow_support branch November 2, 2023 12:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Bow histogram computation #26

Add Bow histogram computation #26

ovysotska commented Oct 24, 2023

ovysotska commented Oct 24, 2023

niosus left a comment

niosus Oct 31, 2023

niosus Oct 31, 2023

ovysotska Nov 1, 2023

niosus Oct 31, 2023

ovysotska Nov 1, 2023

niosus Oct 31, 2023

ovysotska Nov 1, 2023

niosus Oct 31, 2023



		def computeIDF(descriptorsPerImage, clusters):
		"""Compute inverse document frequence (IDF). Here means in how many images does the word occur.



		def getVocabulary(imageTrainFolder, vocabularyFile):
		if vocabularyFile is not None:

Add Bow histogram computation #26

Add Bow histogram computation #26

Conversation

ovysotska commented Oct 24, 2023

ovysotska commented Oct 24, 2023

niosus left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment