- Fundamentatls
- Matrices & Linear Algebra Fundamentals
- Hash Functions, Binary Tree, O(n)
- Relational Algebra, DB Basics
- Inner, Outer, Cross, Theta Join
- CAP Theorem
- Tabular Data
- Entrophy
- Data Frames & Series
- Sharding
- OLAP (Online analytical processing)
- Multidimensional Data Model
- ETL (Extract, Transform, Load)
- Reporting Vs BI Vs Analytics
- JSON & XML
- NoSQL
- Regex
- Vendor Landscape
- Env Setup
- Statistics
- Pick a Dataset (UCI Repo)
- Descriptive Statistics (mean, median, range, SD, Var)
- Exploratory Data Analysis
- Histograms
- Percentiles & Outliers
- Probability Theory
- Bayes Theorem
- Random Variables
- CDF
- Continuous Distributions (Normal, Poisson, Gaussian)
- Skewness
- ANOVA
- CLR
- Monte Carlo Method
- Hypothesis Testing
- p-Value
- Chi-Square Test
- Estimation
- Confidence Interval
- MLE
- Kernel Density Estimation
- Regression
- Covariance
- Correlation
- Pearson Coeff
- Causation
- Least square Fit
- Euclidean Distance
- Programming
- python basics
- working in excel
- r setup & studi
- r basics
- expressions
- variables
- vectors
- matrices
- arrays
- factors
- lists
- data frames
- reading csv data
- reading raw data
- subsetting data
- manipulate data frames
- functions
- factor analysis
- install pkgs
- ibm spss
- rapid miner
- Machine Learning
- what is ml?
- numerical var
- categorical var
- supervised learning
- unsupervised learning
- concepts, inputs & attributes
- training & test data
- classifier
- prediction
- lift
- overfitting
- bias & variance
- trees & classification
- classifciation rate
- decision trees
- boosting
- naive bayes classifiers
- k-nearest clssifiers
- logistic regression
- ranking
- linear regression
- perceptron
- hierarchical clustering
- k-means clustering
- neural networks
- sentiment analysis
- collaborative filtering
- tagging
- Text Mining / Natural Languate Processing
- vocabulary mapping
- classify text
- using nltk
- using weka
- using mahout
- feature extraction
- market based analysis
- association rules
- support vector machines
- term frequency & weight
- term document matrix
- uima
- text analysis
- named entity recognition
- corpus
- Data Visualization
- data exploraion in R
- Big Data
- Data Ingestion
- Data Munging
- Toolbox
citation
Comment is the energy for a writer, thanks!