Tableau ek bahut hi powerful data visualization tool hai, aur ise Python ke saath integrate karke data science projects ko aur bhi behtar banaya ja sakta hai. Yahan ek data science project ka udaharan diya gaya hai jisme Tableau aur Python ka upayog karke data ko analyze aur visualize kiya jayega.
Project Overview:
Goal hai ki sales data ka analysis kiya jaye aur Python ka upayog karke data processing ke liye Tableau mein interactive visualizations banayi jayein.
Step 1: Data Preparation aur Python Scripting-
Mana jata hai ki aapke paas ek sales dataset hai ek CSV file mein (sales_data.csv
). Python ka istemal karke data ko preprocess aur clean karne ke liye kadam uthayein. Saralta ke liye, hum kuch basic tasks par dhyan denge jaise ki data load karna, missing values ko handle karna, aur naye features banane jaise tasks.
import pandas as pd
# Sales data ko load karein
sales_data = pd.read_csv('sales_data.csv')
# Data cleaning aur preprocessing karein
# Example: Missing values ko handle karein
sales_data = sales_data.dropna()
# Example: Total sales ke liye ek naya feature banayein
sales_data['TotalSales'] = sales_data['Quantity'] * sales_data['UnitPrice']
# Processed data ko ek naye CSV file mein save karein
sales_data.to_csv('processed_sales_data.csv', index=False)
Step 2: Processed Data se Tableau ko Connect Karein-
- Tableau kholen aur
processed_sales_data.csv
file se connect karen. - Zarurat ke mutabiq Tableau ke andar additional data preparation aur cleaning karen.
Step 3: Tableau mein Python Calculations Banayein-
- Tableau mein jaakar “Help” menu ko select karein aur “Settings and Performance” > “Manage External Service Connection” ko choose karein.
- Python integration ko enable karein by selecting the Python tab aur Python executable ka path provide karein.
- Advanced calculations ke liye Tableau mein Python calculations banayein. For example, aap ek Python script bana sakte hain jo moving averages ya kisi aur statistical measure ko calculate kare.
SCRIPT_REAL("
import pandas as pd
df = pd.DataFrame(_arg1)
df['MovingAvg'] = df['TotalSales'].rolling(window=3).mean()
return df['MovingAvg'].values.tolist()
", SUM([TotalSales]))
Step 4: Tableau Dashboards Banayein-
- Tableau dashboards banayein aur fields ko canvas par drag-and-drop karke place karein.
- Calculated fields mein Python calculations ka upayog karein takki dashboard interactive ho.
- Filters, parameters, aur actions create karein jisse dashboard interactive ho sake.
Step 5: Tableau Server ya Tableau Online mein Publish Karein-
- Jab aapka Tableau workbook taiyaar ho, use Tableau Server ya Tableau Online par publish karein.
- Interactive dashboard ko stakeholders ke saath share karein.
Step 6: Python ke sath Data Updates ko Automate Karein-
- Python ka upayog karein data ko update karne aur Tableau workbook ko refresh karne ke liye.
- Ek Python script ko automate karein jo naye data ko fetch kare, use preprocess kare, aur
processed_sales_data.csv
file ko update kare.
Yeh example Python ke sath data processing aur advanced calculations ke liye Tableau ke sath integration ko dikhata hai, jisse ek interactive aur dynamic data science project banta hai. Apne dataset aur analysis ke goals ke according steps aur code ko adjust karein.