Miscellaneous notes, data and scripts. This is not a cohesive sub-project.
- Import the individual state-level ZIP code data files with this command:
mongoimport --db test --collection zips <(cat data/zips_*)
- Import the monolithic ZIP code data file with this command:
mongoimport --db test --collection zips data/zips.json
- Export a single state's ZIP code data
mongoexport --db test --collection zips --query '{ "state": "NY" }' --out data/zips_NY.json
Copy the common source code to the other sub-projects with ./copy-common-code.sh
.
General clean-ups, TODOs and things I wish to implement for this project:
- DONE Start authoring the common files like
db.js
andutil.js
in themisc/
sub-project and copy them out to the individual sub-projects as needed. Use a script to copy them out to the other projects. I use this strategyhttps://github.com/dgroomes/kafka-playground/tree/main/utility-scripts
effectively. Technically, the sub-projects still adhere to the "standalone" constraint because there is no runtime or build time dependency between those projects and this projects. A reader can read an individual project and in fact delete themisc/
sub-project and the other sub-project will still work. - DONE make the sampling code print a consolidate output instead of the bulky JSON. The bulkiness is especially
annoying when doing the multiple invocations of the refresh and query and timing the execution time in
materialized/
- DONE Make utility Mongo scripts to load the zips data into Mongo. This is to easily extract state-specific zip code data.
- Maybe undo the common code stuff? The 'incremental/' sub-project has really diverged from the 'materialized/' sub-project and the common code is not as common.