About

DiamondScraper is a simple Python web scraper for BrilliantEarth.com. It scrapes data for both its natural and lab created diamond selection and writes it to a CSV file.

Inspiration

Buying a diamond can be frustrating and expensive.

I built DiamondScraper to create a dataset of natural and lab-created diamonds to demystify the value of the 4 Cs – cut, color, clarity, carat.

Requirements

Firefox browser & geckodriver
pip install gazpacho=1.1
conda install pandas=1.1.3
conda install selenium=3.141.0

Usage

Clone this repo
Move to the DiamondScraper directory
Run scraper.py

There is also a script processing.py to cast categorical data types for a DataFrame.

Attributes

Attribute	Description	Data Type
id	Diamond identification number provided by Brilliant Earth	int
url	URL for the diamond details page	string
shape	External geometric appearance of a diamond	string/categorical
price	Price in U.S. dollars	int
carat	Unit of measurement used to describe the weight of a diamond	float
cut	Facets, symmetry, and reflective qualities of a diamond	string/categorical
color	Natural color or lack of color visible within a diamond, based on the GIA grade scale	string/categorical
clarity	Visibility of natural microscopic inclusions and imperfections within a diamond	string/categorical
report	Diamond certificate or grading report provided by an independent gemology lab	string
type	Natural or lab created diamonds	string
date_fetched	Date the data was fetched	date

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
processing.py		processing.py
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Inspiration

Requirements

Usage

Attributes

Meta

About

Languages

License

corralm/diamond-scraper

Folders and files

Latest commit

History

Repository files navigation

About

Inspiration

Requirements

Usage

Attributes

Meta

About

Topics

Resources

License

Stars

Watchers

Forks

Languages