added vectorized pak version #72

diegodoimo · 2022-07-16T15:48:35Z

I added a ''faster'' version of pak, vectorizing all the for loops and adding a cython version to optimize the likelihood of the full dataset and not just that of a single point. Unfortunately, the performance improvement can be appreciated only for relatively large data sizes:

15% faster for 90k points
50% faster for 250k points

codecov-commenter · 2022-07-16T15:50:14Z

Codecov Report

Merging #72 (17c7f24) into main (f290feb) will decrease coverage by 1.81%.
The diff coverage is 76.19%.

@@            Coverage Diff             @@
##             main      #72      +/-   ##
==========================================
- Coverage   80.61%   78.79%   -1.82%     
==========================================
  Files          10       10              
  Lines        1145     1160      +15     
==========================================
- Hits          923      914       -9     
- Misses        222      246      +24

Impacted Files	Coverage Δ
dadapy/density_estimation.py	`76.29% <75.00%> (-0.40%)`	⬇️
dadapy/_utils/density_estimation.py	`54.83% <76.47%> (-34.96%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f290feb...17c7f24. Read the comment docs.

AldoGl · 2022-07-18T13:20:24Z

Thanks @diegodoimo this is a change that me and @imacocco had in mind for a long time. Apart from the small code improvements suggested I have the following curiosity: for small datasets do we see a near zero improvement or do we actually see worse results using the full Cython implementation?

diegodoimo · 2022-07-18T14:24:38Z

Better in any case. It can be easily tested as I left the flag 'optimized' to select the original implementation, when set to false. (I left it as I didn't want to remove a big piece of the code until we are sure the 'optimized' one can be fine)

diegodoimo added 4 commits July 15, 2022 22:15

time benchmark added

b96e16d

added optimized pak version

ae799ae

small change in the original pak

fd96ec9

lint

81fd69d

diegodoimo requested review from AldoGl and alexdepremia July 16, 2022 15:48

diegodoimo added 2 commits July 16, 2022 22:33

minor changes to cython maxlikelihood full script

d47c9f2

estetics cython

17c7f24

fix small bug

b3e610e

small improvements optimized pak

4188631

diegodoimo merged commit 9cffc40 into main Jul 19, 2022

diegodoimo deleted the optimization_pak branch July 19, 2022 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added vectorized pak version #72

added vectorized pak version #72

diegodoimo commented Jul 16, 2022

codecov-commenter commented Jul 16, 2022 •

edited

Loading

AldoGl commented Jul 18, 2022

diegodoimo commented Jul 18, 2022 •

edited

Loading

added vectorized pak version #72

added vectorized pak version #72

Conversation

diegodoimo commented Jul 16, 2022

codecov-commenter commented Jul 16, 2022 • edited Loading

Codecov Report

AldoGl commented Jul 18, 2022

diegodoimo commented Jul 18, 2022 • edited Loading

codecov-commenter commented Jul 16, 2022 •

edited

Loading

diegodoimo commented Jul 18, 2022 •

edited

Loading