Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Determine the limits of Voyant-tools #2

Open
gacou54 opened this issue Aug 31, 2021 · 1 comment
Open

Determine the limits of Voyant-tools #2

gacou54 opened this issue Aug 31, 2021 · 1 comment
Assignees

Comments

@gacou54
Copy link

gacou54 commented Aug 31, 2021

  • Evaluate how many documents can be loaded by the web interface
  • Evaluate how many documents can be handled by Voyant-tools, frontend and backend (considering that the documents could be pre-loaded via the backend)
@gacou54 gacou54 self-assigned this Aug 31, 2021
@gacou54
Copy link
Author

gacou54 commented Aug 31, 2021

Here are some results of some tests:

With an instance with a 5 GB limit

  • 500 documents | Loadable (a few minutes) | 3.0 GB
  • 1000 documents | Loadable (a few minutes) | 4.0 GB
  • 1500 documents | Loadable (a few minutes) | 3.7 GB
  • 2000 documents | Loadable (a few minutes) | 4.2 GB
  • 2500 documents | Loadable (a few minutes) | 3.9 GB (crashed using the contexts tool (which required 5.4 GB)

With an instance with a 7 GB limit

  • 2500 documents | Loadable (a few minutes) | 4.4 GB
  • 3000 documents | Loadable (a few minutes) | 4.6GB (using the context tool 6.4)
  • 3500 documents | Loadable (a few minutes) | 4.6 Go (crash when using the context tool (requested Go unknown))

The web interface no longer becomes usable for 2500-3000 documents, the use of different tools (eg Terms, Trends, Contexts, etc.) can crash the application or are particularly slow. Also, it becomes barely impossible to consult the results on the web interface. The limitations seem to be as much on the frontend side as on the backend side. In either case, there is a memory limit on the client and server-side.

On the frontend side, the problem seems to be the display and storage in memory of potentially several hundred million words. There may be an option to show only part of the results or documents, even if the results were all been calculated in the backend.

On the backend side, the limitation seems to be a memory limitation. We should check if there is a way to use storage rather than memory while performing analyses.

@gacou54 gacou54 changed the title Déterminer les limites de Voyant-tools Determine the limits of Voyant-tools Sep 1, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant