More and more applications are now dealing with massive data that need to be processed in realtime. While easing the development of realtime analytics applications, computing platforms like Storm increases the need for efficient algorithms that can run on a single pass on the data stream. In this talk, I'll give a brief overview of some interesting probabilistic data structures that can used in this context: Bloomfilter, Temporal Bloomfilter, Count-Min Sketch and HyperLogLog.
-
Notifications
You must be signed in to change notification settings - Fork 2
mlaprise/pydata2013-pds-talk
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Probabilistic Data Structures for Realtime Analytics (PyData 2013)
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published