Configurable parameters to improve performance while reading page blobs #378

majumd · 2021-01-06T18:28:00Z

Hi,
I would like to know the way to improve performance while reading from a page blob. Are there any configurable parameters such as control the number of threads or buffer size which could be used to improve performance?
An enhancement to have the performance factors configurable to tweak as per the environment would be helpful.
Thanks
Udayan

Jinming-Hu · 2021-01-08T02:59:38Z

Hi @majumd , every blob API accepts a blob_request_options as a parameter. blob_request_options has a member function set_parallelism_factor with which you can set the max number of threads performing the download operation.

Jinming-Hu · 2021-01-08T03:01:18Z

You also mentioned buffer size, actually there will be multiple data copy during the download process. For example, you download 100MB blob, the 100MB data will be copied 2 or 3 times (I cannot remember). Is this also something you want to optimize?

majumd · 2021-01-09T15:36:40Z

Hi @majumd , every blob API accepts a blob_request_options as a parameter. blob_request_options has a member function set_parallelism_factor with which you can set the max number of threads performing the download operation.

Thanks for the response. I could see that the default value of the member variable is m_parallelism_factor is 1. Could you please explain how this could be used to improve data read performance from Azure Cloud.

Suppose we would like to read 40MB of data, Could the value of the variable be set to 10 using function set_parallelism_factor ?
Does it mean that now the read request of 40MB would ideally take the same time as the time taken for 4MB as 10 parallel requests would be made to Azure each request requesting for 4MB data as per m_stream_read_size?

Jinming-Hu · 2021-01-10T03:08:21Z

Suppose we would like to read 40MB of data, Could the value of the variable be set to 10 using function set_parallelism_factor ?
Does it mean that now the read request of 40MB would ideally take the same time as the time taken for 4MB as 10 parallel requests would be made to Azure each request requesting for 4MB data as per m_stream_read_size?

Yes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable parameters to improve performance while reading page blobs #378

Configurable parameters to improve performance while reading page blobs #378

majumd commented Jan 6, 2021

Jinming-Hu commented Jan 8, 2021

Jinming-Hu commented Jan 8, 2021 •

edited

Loading

majumd commented Jan 9, 2021

Jinming-Hu commented Jan 10, 2021

Configurable parameters to improve performance while reading page blobs #378

Configurable parameters to improve performance while reading page blobs #378

Comments

majumd commented Jan 6, 2021

Jinming-Hu commented Jan 8, 2021

Jinming-Hu commented Jan 8, 2021 • edited Loading

majumd commented Jan 9, 2021

Jinming-Hu commented Jan 10, 2021

Jinming-Hu commented Jan 8, 2021 •

edited

Loading