Stata

With Stata, choose the reference in statistical software

Fast. Specific. Easy to use. Stata is a complete and integrated software that meets all your data science needs: data manipulation, visualization, statistics and automated reporting.

5 good reasons to use this software

  • Comprehensive statistical software
  • Easy to use and learn
  • Reference software for econometrics, epidemiology and survey processing
  • Rich technical support and documentation
  • Easy and fast processing of large data sets

Very satisfied with the training: acquisition of basic knowledge on STATA. The trainer was clear, showed attentive listening and high pedagogical skills. 

Thank you

Erik Zolotoukhine
CNRS
STATA beginners training

Why choose Stata ?

Stata is a comprehensive software, that includes functionalities for data manipulation, visualization, statistics and automated reports

Stata is the reference for :

  • Econometrics
  • Epidemiology
  • Survey processing
  • Behavioral science
  • Biostatistics
  • Data science
  • Institutional research
  • Medicine
  • Political science
  • Education
  • Finance, business and marketing
  • Public Health
  • Public Policy
  • Sociology

Data Management :

Import/export, Data transformation, ODBC support, Combination of datasets, Specific data management (cross-sectional data: panel data, time series, longitudinal data, slice data …)

Key statistical features:

Contrast, pairwise, comparison and margins, epidemiology, GMM, longitudinal data/panel data, Bayesian analysis, Structural Equation Modelling (SEM), survey processing, treatment effects, time series

Quality graphic output for your publications and reports:

Reproducibility, Testing, Certification (FDA):

  1. Reproducibility between different operating systems and between different editions of Stata
  2. Tested, verified, and certified commands and functions
  3. Certified by the FDA for submission to regulatory agencies

Simple to use and easy to learn even for non programmers

StataCorp Technical Support:

  1. As a user of the current edition of Stata (Stata 17), you are entitled to technical support
  2. Fast, clear, precise and detailed answers
  3. Answers by researchers and business experts (master, PHD)

Documentation, resource sharing, user community:

  • 33 manuals with over 17,000 pages
  • User forum, Stata researchers’ blog, community-contributed commands
  • Stata Journal: class A journal
  • International Stata conferences

A complete package, without additional modules

Compatible with any platform (Windows, Mac, Linux)

Fast and adapted to the power of the machines and the volume of data to be processed (Stata BE/SE/MP):

  1. Stata/BE: Basic Edition; for medium-sized datasets
  2. Stata/SE: Standard Edition; for larger datasets
  3. Stata/MP: Multi-Processors ; the fastest edition of Stata (for dual-core, quad-core and multi-core/multi-processor computers) that can analyze the largest datasets

Affordable pricing

Stata is sold with all modules in one package !

Stata offers several purchase options to fit your budget. You can choose an annual license to ensure you always have access to the latest features, or you can choose a perpetual license. Contact a sales representative or browse our products to learn more about our affordable options.

SoftwareMax. number of
variables
Max. number of variables
right-hand
Max. number of
observations
Edition
64-bit
available
Design
for parallel
parallel computing?
Plateforms
Stata/MP120 00065 53220 billion*YesYesWin, Mac
(64-bits)
or Unix
Stata/SE32 76710 9982,14 billion*YesNoWin, Mac
or Unix
Stata/BE2 0487982,14 billion*
YesNoWin, Mac
or Unix

Features

Data management

Data transformations, match-merge, data import/export, ODBC, SQL, XML, group processing, adding files, sorting, row-column transposition, labeling, and saving results…

Basic statistical features

Summaries, cross-tabulations, correlations, t-tests, equality of variance tests, proportion tests, confidence intervals, factorial variables…

Graphics

Curves, scatterplots, histograms, pie charts, hi-lo graphs, contour plots, graph editor, diagnostic regression plots, non-parametric smoothers, Q-Q graph distribution…

Graphical user interface

Results window, Command window, Examination window, Data editor, Head of variables, Ne-file editor, Variable properties, Viewer, Clipboard preview tool, menus/dialogs for all commands, multiple preference sets…

Longitudinal / panel data

Random and fixed effects with robust standard errors, linear mixed models, random effects probit, GEE, random and fixed Poisson effects, dynamic panel data models, and instrumental variables regression models; AR (1) perturbations…

Time series

ARIMA, ARFIMA, ARCH / GARCH, VAR, VECM, multivariate GARCH, unobserved component models, space-state models, business schedules, forecasts, impulse-response functions, unit root tests, and smoothers

Multilevel mixed effects models

Continuous and binary outcomes; cross-random effects; ML and REML estimation; residual error structures, support for survey data in multilevel linear models…

Survey Methods

Bootstrap, BRR, jackknife, linearized, and SDR variance estimation; poststratification; DEFF; predictive margins; means, proportions, ratios, totals; summary tables; instrumental variables, probit, Cox regression

ANOVA / MANOVA

Symmetric and asymmetric designs; factorial, nested,
and mixed designs; repeated measures; marginal means; contrasts…

Survival analysis

Kaplan-Meier and Nelson-Aalen estimators, Cox regression (fragility), parametric models (fragility); competing risks; covariates; left and right censoring, Weibull, exponential, Gompertz analysis; sample size and power analysis…

SEM (Structural Equation Modeling)

Graphical model builder, standardized and non-standardized estimates, modification indices, direct and indirect effects, path diagrams, factor scores and other predictions, estimates with groups and invariance tests, goodness of fit, MAR data manipulation by FIML, survey data, pooled data

Tests, predictions and effects

Wald tests, LR tests; linear and non-linear combinations, generalized forecasts and predictions, marginal means, least squares means, adjusted means; marginal and partial effects; Hausman tests…

Multivariate methods

Factor analysis, principal components, discriminant analysis, rotation, multidimensional scaling, Procrustes analysis, correspondence analysis, biplots, dendrograms…

Contrasts and pairwise comparisons

Compare means, intercepts, or slopes, comparison of adjacent categories; compare with reference or grand mean category; orthogonal polynomials, adjustment for multiple comparisons; treatment effects, graphical effects and potential outcomes…

Generalized Linear Models (GLM)

Ten link functions, user-defined links, seven distributions, ML and IRLS estimation, nine variance estimators, seven residuals

Epidemiology

Rate normalization, case-control, cohort, matched case-control, Mantel-Haenszel, pharmacokinetics, ROC analysis, ICD-9-CM…

Non-parametric methods

Wilcoxon-Mann-Whitney ranks, Wilcoxon, and Kruskal-Wallis tests; Spearman and Kendall correlations; Kolmogorov-Smirnov tests; exact binomial CI; survival data, ROC analysis; smoothing; bootstrapping…

Other statistical methods

Sample size and power, kappa measure of agreement between two coders, Cronbach’s alpha, mathematical and statistical functions, and much more.

Exact statistics

Logistic and exact Poisson regression, exact case-control statistics, binomial tests, Fisher’s exact test for r × c tables…

Non-linear regression GMM

Generalized method of moments (GMM), non linear regression…

Resampling and simulation methods

Bootstrap, jackknife, and Monte Carlo simulation, permutation tests…

Maximum likelihood

User-specified functions; NR, DFP, BFGS, BHHH; OIM, OPG, robust, bootstrap, and jackknife matrices; Wald tests
Wald tests; survey data; numerical or analytical derivatives…

Other statistical methods

Sample size and power, kappa measure of agreement between two coders, Cronbach’s alpha, mathematical and statistical functions, …

Programming language

Added new commands, scripting command, if, while, parsing command, debugging, menu and programming dialog box, markup and control language…

Installation qualification

IQ report for regulatory agencies such as the FDA, verification of the accessibility of the facility for people with disabilities.

Commands provided by the community

For meta-analysis, data management, survival, econometrics

Internet capacity

Possibility to install new commands, web updates, web file sharing, latest news from Stata, and much more…

Stata 17 new features

A faster Stata

Stata values accuracy and speed. There is often a trade-off between the two and Stata strives to give users the best of both worlds.

The algorithms behind sort and collapse have been updated to make these commands faster. The speed of some estimation commands such as mixed, which fits multilevel mixed effects models, has also been improved.

Tables

Better tables were requested by users. And here they are !

You can easily create tables that compare regression results or summary statistics, you can create styles and apply them to any table you create, and you can export your tables to Word, PDF, HTML, LaTeX, Excel or Markdown and include them in reports.

The table command has been redesigned. The new Collect prefix gathers as many results from as many commands as you want, builds tables, exports them in many formats, and more !

You can also point and click to create tables using the new Table Builder.

Bayesian Econometrics

Stata 17 now does Bayesian econometrics !

You want to use probabilistic statements to answer economic questions, for example: Are people who participate in a job training program more likely to stay employed over the next five years? Do you want to incorporate prior knowledge about an economic process?

The new Bayesian econometrics features in Stata can help you !

Add many Bayesian models such as cross-sectional, panel data, multilevel and time series models. Compare models using Bayes factors. Get predictions and forecasts. And much more!

Pystata and Jupyter Notebook with Stata

What is PyStata?

  • It’s all about the ways Stata and Python can interact.
  • You could already call Python code from Stata, now you can call Stata (and Mata) from Python via a new Python package pystata!
  • This new package can be used in conjunction with the existing Stata Function Interface (SFI) module, making it even easier for Stata and Python to interact.
  • PyStata also allows you to access Stata from a Jupyter notebook.

Configuration

Operating System

Operating SystemVersion
Microsoft WindowsWindows 10 *
Windows 8 *
Windows 7 *
Windows Server 2019, 2016, 2012, 2008, 2003 *
Apple MacOSStata on macOS requires a 64-bit Intel® processor (Core™2 Duo or higher)
Stata requires Mac OS 10.11 or later
LinuxAny 64-bit architecture (x86-64 or compatible)
For xstata, you will need to install GTK 2.24
(*) Windows 64-bit

Configuration mémoire et espace disque

PackageRAMDisk space
Stata/MP
fast and powerful
4 Go1 Go
Stata/SE
large data sets
2 Go1 Go
Stata/BE
basic edition
1 Go1 Go

Stata on Unix requires a graphics card that can display thousands of colors or more (16-bit or 24-bit colors)

Starting from
Prices
Subscription
595 € / year
Student subscription
94 € / year