Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • ketos ketos
  • Project information
    • Project information
    • Activity
    • Labels
    • Planning hierarchy
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 27
    • Issues 27
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • public_projects
  • ketosketos
  • Merge requests
  • !247

Merged
Created May 27, 2022 by Bruno Padovese@bpadoveseOwner

Optimization create_rndm_selections

  • Overview 6
  • Commits 7
  • Pipelines 6
  • Changes 3

Previous versions of the method ketos.data_handling.selection_table.create_rndm_selections were painfully slow when generating large amounts of samples (10000+) to the points that it was a major bottleneck when creating a new dataset with the method.

I made several changes to method both improving readability (it should be easier to understand and follow the code) as well as to the efficiency. It is now significantly faster when trying to generate a large amount of selections.

In my tests, when trying to generate 200000+ selections it now takes 17 s as opposed to the 331 s it took before. So by about a factor of 20 in this case. With fewer selections, there is still an improvement but it is not as significant.

From my tests, the code is working exactly the same.

Assignee
Assign to
Reviewer
Request review from
Time tracking
Source branch: optimization_rdnm_selections