Skip to content

Probabilistic Data Structures for Realtime Analytics (PyData 2013)

Notifications You must be signed in to change notification settings

mlaprise/pydata2013-pds-talk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Probabilistic Data Structures for Realtime Analytics (PyData 2013)

More and more applications are now dealing with massive data that need to be processed in realtime. While easing the development of realtime analytics applications, computing platforms like Storm increases the need for efficient algorithms that can run on a single pass on the data stream. In this talk, I'll give a brief overview of some interesting probabilistic data structures that can used in this context: Bloomfilter, Temporal Bloomfilter, Count-Min Sketch and HyperLogLog.

About

Probabilistic Data Structures for Realtime Analytics (PyData 2013)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages