StatSim → Gen
0.0.4

Generate synthetic data in the browser

Synthetic datasets for testing and exploration

Dataset Type Variables Description
Friedman 1Regression10 + 1y = 10 * sin(Pi * x1 * x2) + 20 * (x3 - 0.5) ** 2 + 10 * x4 + 5 * x5 + e
Friedman 2Regression4 + 1y = sqrt(x1 ** 2 + (x2 * x3 - 1 / (x2 * x4)) ** 2) + e
Friedman 3Regression4 + 1y = atan(x2 * x3 - 1 / (x2 * x4) / x1) + e
PeakRegression10 + 1Peak Benchmark Problem. From: mlbench
HastieClassification10 + 1Binary classification problem used in Hastie et al
MoonsClassification2 + 1Two interleaving half circles
SpiralsClassification2 + 1Two entangled spirals
RingnormClassification10 + 1Breiman, L. (1996). Bias, variance, and arcing classifiers
Based on port and mkdata

Star Issue