Violin plot: Difference between revisions
Adding short description: "Method of plotting numeric data" (Shortdesc helper) |
+ ROOT framework |
||
Line 14: | Line 14: | ||
Although more informative than box plots, they are less popular. Because of their unpopularity, their meaning can be harder to grasp for many readers not familiar with the violin plot representation. In this case, a more accessible alternative can be plotting a series of stacked histograms or [[Kernel_density_estimation|kernel density distributions]]. |
Although more informative than box plots, they are less popular. Because of their unpopularity, their meaning can be harder to grasp for many readers not familiar with the violin plot representation. In this case, a more accessible alternative can be plotting a series of stacked histograms or [[Kernel_density_estimation|kernel density distributions]]. |
||
Violin plots are available as extensions to a number of software packages, including the [[R (programming language)|R]] packages vioplot, wvioplot, caroline, UsingR, lattice and [[ggplot2]], the [[Stata]] add-on command vioplot,<ref>{{cite journal |last=Hintze |first=Jerry L. |first2=Ray D. |last2=Nelson |title=Violin Plots: A Box Plot-Density Trace Synergism |journal=The American Statistician |volume=52 |issue=2 |pages=181–4 |year=1998 |doi=10.1080/00031305.1998.10480559 |url=http://search.proquest.com/openview/dcd68eb137d2d6b08aa23f37e34e0b01/1?pq-origsite=gscholar}}</ref> and the [[Python_(programming_language) | Python]] libraries [[matplotlib]]<ref>{{cite web |title=violin plots |work=What's new in matplotlib |url=http://matplotlib.org/1.4.0/users/whats_new.html#violin-plots}}</ref>, [[Plotly]]<ref>{{cite web |title=Violin Plots in Python |work=Plotly Python API Library Reference |url=https://plot.ly/python/violin-plot/}}</ref>, and [[Seaborn (software library)|Seaborn]]<ref>{{cite web |first=Michael |last=Waskom |title=Violinplot from a wide-form dataset |date= |work=Seaborn: statistical data visualization |publisher= |url=https://seaborn.pydata.org/examples/wide_form_violinplot.html}}</ref>, a graph type in [[Origin (data analysis software)|Origin]] <ref>{{cite web|url=http://blog.originlab.com/graphing/violin-plot | accessdate=2018-10-29 | title=Violin Plot in Origin 2019|publisher=}}</ref>, [[IGOR Pro]] <ref>{{cite web| url=https://www.wavemetrics.com/igor-8-highlights| accessdate=2019-07-28| title=Igor Pro 8 Highlights| publisher=Wavemetrics}}</ref> |
Violin plots are available as extensions to a number of software packages, including the [[R (programming language)|R]] packages vioplot, wvioplot, caroline, UsingR, lattice and [[ggplot2]], the [[Stata]] add-on command vioplot,<ref>{{cite journal |last=Hintze |first=Jerry L. |first2=Ray D. |last2=Nelson |title=Violin Plots: A Box Plot-Density Trace Synergism |journal=The American Statistician |volume=52 |issue=2 |pages=181–4 |year=1998 |doi=10.1080/00031305.1998.10480559 |url=http://search.proquest.com/openview/dcd68eb137d2d6b08aa23f37e34e0b01/1?pq-origsite=gscholar}}</ref> and the [[Python_(programming_language) | Python]] libraries [[matplotlib]]<ref>{{cite web |title=violin plots |work=What's new in matplotlib |url=http://matplotlib.org/1.4.0/users/whats_new.html#violin-plots}}</ref>, [[Plotly]]<ref>{{cite web |title=Violin Plots in Python |work=Plotly Python API Library Reference |url=https://plot.ly/python/violin-plot/}}</ref>, [[ROOT]]<ref>{{cite web |url=https://root.cern/doc/master/classTHistPainter.html#HP140b |accessdate=2020-05-05 |title=The VIOLIN option}}</ref> and [[Seaborn (software library)|Seaborn]]<ref>{{cite web |first=Michael |last=Waskom |title=Violinplot from a wide-form dataset |date= |work=Seaborn: statistical data visualization |publisher= |url=https://seaborn.pydata.org/examples/wide_form_violinplot.html}}</ref>, a graph type in [[Origin (data analysis software)|Origin]] <ref>{{cite web|url=http://blog.originlab.com/graphing/violin-plot | accessdate=2018-10-29 | title=Violin Plot in Origin 2019|publisher=}}</ref>, [[IGOR Pro]] <ref>{{cite web| url=https://www.wavemetrics.com/igor-8-highlights| accessdate=2019-07-28| title=Igor Pro 8 Highlights| publisher=Wavemetrics}}</ref> and [https://reference.wolfram.com/language/ref/DistributionChart.html DistributionChart] in [[Wolfram Mathematica|Mathematica]]. |
||
==References== |
==References== |
Revision as of 13:32, 5 May 2020
A violin plot is a method of plotting numeric data. It is similar to a box plot, with the addition of a rotated kernel density plot on each side.[1]
Violin plots are similar to box plots, except that they also show the probability density of the data at different values, usually smoothed by a kernel density estimator. Typically a violin plot will include all the data that is in a box plot: a marker for the median of the data; a box or marker indicating the interquartile range; and possibly all sample points, if the number of samples is not too high.
A violin plot is more informative than a plain box plot. While a box plot only shows summary statistics such as mean/median and interquartile ranges, the violin plot shows the full distribution of the data. The difference is particularly useful when the data distribution is multimodal (more than one peak). In this case a violin plot shows the presence of different peaks, their position and relative amplitude.
Like box plots, violin plots are used to represent comparison of a variable distribution (or sample distribution) across different "categories" (for example, temperature distribution compared between day and night, or distribution of car prices compared across different car makers).
A violin plot can have multiple layers. For instance, the outer shape represents all possible results. The next layer inside might represent the values that occur 95% of the time. The next layer (if it exists) inside might represents the values that occur 50% of the time.
Although more informative than box plots, they are less popular. Because of their unpopularity, their meaning can be harder to grasp for many readers not familiar with the violin plot representation. In this case, a more accessible alternative can be plotting a series of stacked histograms or kernel density distributions.
Violin plots are available as extensions to a number of software packages, including the R packages vioplot, wvioplot, caroline, UsingR, lattice and ggplot2, the Stata add-on command vioplot,[2] and the Python libraries matplotlib[3], Plotly[4], ROOT[5] and Seaborn[6], a graph type in Origin [7], IGOR Pro [8] and DistributionChart in Mathematica.
References
- ^ "VIOLIN PLOT". NIST DataPlot. National Institute of Standards and Technology. 2015-10-13.
- ^ Hintze, Jerry L.; Nelson, Ray D. (1998). "Violin Plots: A Box Plot-Density Trace Synergism". The American Statistician. 52 (2): 181–4. doi:10.1080/00031305.1998.10480559.
- ^ "violin plots". What's new in matplotlib.
- ^ "Violin Plots in Python". Plotly Python API Library Reference.
- ^ "The VIOLIN option". Retrieved 2020-05-05.
- ^ Waskom, Michael. "Violinplot from a wide-form dataset". Seaborn: statistical data visualization.
- ^ "Violin Plot in Origin 2019". Retrieved 2018-10-29.
- ^ "Igor Pro 8 Highlights". Wavemetrics. Retrieved 2019-07-28.
External links
- Vioplot add-in for Stata
- Violinplot from a wide-form dataset with the seaborn statistical visualization library based on matplotlib
This article incorporates public domain material from Dataplot reference manual: Violin plot. National Institute of Standards and Technology.