Use Consistent Naming Conventions

Share ideas, strategies, and trends in the crypto database.
Post Reply
Bappy10
Posts: 617
Joined: Sat Dec 21, 2024 3:46 am

Use Consistent Naming Conventions

Post by Bappy10 »

Simple Action: A simple text file (README.txt), a few comments in your code, or even a quick bulleted list in a shared document like Google Docs. For example: "1. Loaded raw_sales.csv. 2. Split 'Items' column by comma. 3. Grouped by 'Customer ID' to count total items."
Version Control Your Code/Scripts (Even if it's just basic copying):

Why: Your "LIST TO DATA" process often involves code or scripts. Changes can introduce errors. list to data Version control allows you to revert to previous working versions.
How: Use a system like Git (even just locally for simple projects). For absolute beginners, just save dated copies of your script files (e.g., transform_v1.py, transform_v2_fix.py).
Simple Action: If using Git, learn git add ., git commit -m "initial transformation". If not, just copy your script file and rename it with a date or version number before making significant changes.
Why: Disorganized file and variable names lead to confusion and lost data.
How: Decide on a standard for filenames (e.g., YYYYMMDD_datasetname_raw.csv, YYYYMMDD_datasetname_processed.csv) and variable names in your code.
Simple Action: Stick to snake_case for variables, meaningful file names, and clearly distinguish raw from processed data.
Store in Accessible and Backed-Up Locations:

Why: Data is useless if you can't find it or if it's lost due to a hard drive crash.
How: Use cloud storage (Google Drive, OneDrive, Dropbox), a shared network drive, or a source control repository. Ensure these locations are backed up.
Simple Action: Save your output files and scripts directly to a cloud-synced folder. If your company has a shared drive, use that.
Validate Your Transformed Data (Spot Checks):
Post Reply