Data Science: About
Data science - a multi-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured, similar to data mining.
Disciplines involved: information science, and computer science, mathematics, statistics.
Data science is now often used interchangeably as a buzzaord for earlier field like business analytics, business intelligence, predictive modeling, and statistics.
Data Science Stages
- Capture Data
- Data acquisition
- Data entry
- Signal reception
- Data extraction
- Maintain Data
- Data warehousing
- Data cleansing
- Data staging
- Data processing
- Datsa architecture
- Process
- Data mining
- Clustering / classification
- Data modeling
- Data summarization
- Communicate
- Data reporting
- Data visualization
- Business intelligence
- Decision making
- Analyze
- Exporatory / Confirmatory
- Predicive analysis
- Regression
- Text mining
- Qualitative analysis
Three main programming skills involved:
- R
- Python
- SQL
Second tier technology skills involved:
- Apache Hadoop
- Hadoop
- NoSQL
- SAS
- AI
- Machine learning
- MATLab
- Cloud computing
- Apache Spark
- GitHun
- Tableau
- iPython notebooks
- Excel
Data Science Job Titles:
- Data Sicentist
- Data Analyst
- Data Engineer
Data skills:
- Data Munging - the data wrangling that brings together data into cohesive views, as well as the cleaning up data so that it is polished and ready for usage