Skip to main content

SAH Analytic Capabilities

SAH has the statistical package R embedded, but can be used with any statistical package. SAH has text mining, time series, graph anaylsis capabilities and has interoperability with major cloud provider tools.

Classification Analysis

  • CART
  • C4.5 Decision Tree Analysis
  • CHAID Decision Tree Analysis
  • K Nearest Neighbour
  • Logisitic Regression Elastic Net
  • Back-Propagation (Neural Network)
  • Naive Bayes
  • Support Vector Machine
  • Random Forests
  • Confusion Matrix
  • Area Under Curve (AUC)
  • Parameter Selection/Model Evaluation

Regression

  • Multiple Linear Regression Elastic Net
  • Polynomial Regression
  • Exponential Regression
  • Bi-Variate Geometric Regression
  • Bi-Variate Logarithmic Regression

Cluster Analysis

  • ABC Classification
  • DBSCAN
  • K-Means
  • K-Medoid Clustering
  • K-Medians
  • Kohonen Self Organized Maps
  • Agglomerate Hierarchical
  • Affinity Propagation
  • Latent Dirichlet Allocation (LDA)
  • Gaussian Mixture Model (GMM)
  • Cluster Assignment

Time Series Analysis

  • Single/Double/Triple Exponential Smoothing
  • Forecast Smoothing
  • ARIMA/Seasonal ARIMA
  • Brown Exponential Smoothing
  • Croston Method
  • Forecast Accuracy Measure
  • Linear Regression with Damped Trend and Seasonal Adjust
  • Test for White Noise, Trend, Seasonality

Association Analysis

  • Apriori
  • Apriori Lite
  • FP-Growth
  • KORD - Top K Rule Discovery

Probability Distribution

  • Distribution Fit/Weibull analysis
  • Cumulative Distribution Function
  • Quantile Function
  • Kaplan-Meier Survival Analysis

Outlier Detection

  • Inter-Quartile Range Test (Turkey's Test)
  • Variance Test
  • Anomaly Detection
  • Grubbs Outlier Test

Link Prediction

  • Common Neighbors
  • Jaccard's Coefficient
  • Adamic/Adar
  • Katzβ

Statistic Functions

  • Mean, Median, Variance, Standard Deviation, Kurtosis, Skewness
  • Covariance Matrix
  • Pearson Correlations Matrix
  • Chi-squared Tests:
    • Test of Quality of Fit
    • Test of Independence
  • F-test (variance of equal test)

Data Preparation

  • Sampling
  • Binning
  • Scaling
  • Partitioning
  • Principal Component Analysis (PCA)/PCA Projection

Other

  • Weighted Scores Table
  • Substitute Missing Values