Overview
Brought to you by YData
Dataset statistics
| Number of variables | 5 |
|---|---|
| Number of observations | 150 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 2 |
| Duplicate rows (%) | 1.3% |
| Total size in memory | 6.0 KiB |
| Average record size in memory | 40.9 B |
Variable types
| Numeric | 4 |
|---|---|
| Categorical | 1 |
| Dataset has 2 (1.3%) duplicate rows | Duplicates |
petal_length is highly overall correlated with petal_width and 2 other fields | High correlation |
petal_width is highly overall correlated with petal_length and 2 other fields | High correlation |
sepal_length is highly overall correlated with petal_length and 2 other fields | High correlation |
species is highly overall correlated with petal_length and 2 other fields | High correlation |
species is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2025-03-13 15:29:57.582949 |
|---|---|
| Analysis finished | 2025-03-13 15:29:59.043868 |
| Duration | 1.46 second |
| Software version | ydata-profiling vv4.14.0 |
| Download configuration | config.json |
Variables
sepal_length
Real number (ℝ)
High correlation 
| Distinct | 35 |
|---|---|
| Distinct (%) | 23.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8433333 |
| Minimum | 4.3 |
|---|---|
| Maximum | 7.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.3 KiB |
Quantile statistics
| Minimum | 4.3 |
|---|---|
| 5-th percentile | 4.6 |
| Q1 | 5.1 |
| median | 5.8 |
| Q3 | 6.4 |
| 95-th percentile | 7.255 |
| Maximum | 7.9 |
| Range | 3.6 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 0.82806613 |
|---|---|
| Coefficient of variation (CV) | 0.14171126 |
| Kurtosis | -0.55206404 |
| Mean | 5.8433333 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | 0.31491096 |
| Sum | 876.5 |
| Variance | 0.68569351 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=35)
| Value | Count | Frequency (%) |
| 5 | 10 | 6.7% |
| 5.1 | 9 | 6.0% |
| 6.3 | 9 | 6.0% |
| 5.7 | 8 | 5.3% |
| 6.7 | 8 | 5.3% |
| 5.8 | 7 | 4.7% |
| 5.5 | 7 | 4.7% |
| 6.4 | 7 | 4.7% |
| 4.9 | 6 | 4.0% |
| 5.4 | 6 | 4.0% |
| Other values (25) | 73 |
| Value | Count | Frequency (%) |
| 4.3 | 1 | 0.7% |
| 4.4 | 3 | 2.0% |
| 4.5 | 1 | 0.7% |
| 4.6 | 4 | 2.7% |
| 4.7 | 2 | 1.3% |
| 4.8 | 5 | |
| 4.9 | 6 | |
| 5 | 10 | |
| 5.1 | 9 | |
| 5.2 | 4 | 2.7% |
| Value | Count | Frequency (%) |
| 7.9 | 1 | 0.7% |
| 7.7 | 4 | |
| 7.6 | 1 | 0.7% |
| 7.4 | 1 | 0.7% |
| 7.3 | 1 | 0.7% |
| 7.2 | 3 | |
| 7.1 | 1 | 0.7% |
| 7 | 1 | 0.7% |
| 6.9 | 4 | |
| 6.8 | 3 |
sepal_width
Real number (ℝ)
| Distinct | 23 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.054 |
| Minimum | 2 |
|---|---|
| Maximum | 4.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.3 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2.345 |
| Q1 | 2.8 |
| median | 3 |
| Q3 | 3.3 |
| 95-th percentile | 3.8 |
| Maximum | 4.4 |
| Range | 2.4 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.43359431 |
|---|---|
| Coefficient of variation (CV) | 0.14197587 |
| Kurtosis | 0.29078106 |
| Mean | 3.054 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | 0.33405266 |
| Sum | 458.1 |
| Variance | 0.18800403 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=23)
| Value | Count | Frequency (%) |
| 3 | 26 | |
| 2.8 | 14 | |
| 3.2 | 13 | 8.7% |
| 3.1 | 12 | 8.0% |
| 3.4 | 12 | 8.0% |
| 2.9 | 10 | 6.7% |
| 2.7 | 9 | 6.0% |
| 2.5 | 8 | 5.3% |
| 3.5 | 6 | 4.0% |
| 3.3 | 6 | 4.0% |
| Other values (13) | 34 |
| Value | Count | Frequency (%) |
| 2 | 1 | 0.7% |
| 2.2 | 3 | 2.0% |
| 2.3 | 4 | 2.7% |
| 2.4 | 3 | 2.0% |
| 2.5 | 8 | 5.3% |
| 2.6 | 5 | 3.3% |
| 2.7 | 9 | 6.0% |
| 2.8 | 14 | |
| 2.9 | 10 | 6.7% |
| 3 | 26 |
| Value | Count | Frequency (%) |
| 4.4 | 1 | 0.7% |
| 4.2 | 1 | 0.7% |
| 4.1 | 1 | 0.7% |
| 4 | 1 | 0.7% |
| 3.9 | 2 | 1.3% |
| 3.8 | 6 | |
| 3.7 | 3 | 2.0% |
| 3.6 | 3 | 2.0% |
| 3.5 | 6 | |
| 3.4 | 12 |
petal_length
Real number (ℝ)
High correlation 
| Distinct | 43 |
|---|---|
| Distinct (%) | 28.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7586667 |
| Minimum | 1 |
|---|---|
| Maximum | 6.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.3 |
| Q1 | 1.6 |
| median | 4.35 |
| Q3 | 5.1 |
| 95-th percentile | 6.1 |
| Maximum | 6.9 |
| Range | 5.9 |
| Interquartile range (IQR) | 3.5 |
Descriptive statistics
| Standard deviation | 1.7644204 |
|---|---|
| Coefficient of variation (CV) | 0.46942721 |
| Kurtosis | -1.4019208 |
| Mean | 3.7586667 |
| Median Absolute Deviation (MAD) | 1.25 |
| Skewness | -0.27446425 |
| Sum | 563.8 |
| Variance | 3.1131794 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=43)
| Value | Count | Frequency (%) |
| 1.5 | 14 | 9.3% |
| 1.4 | 12 | 8.0% |
| 5.1 | 8 | 5.3% |
| 4.5 | 8 | 5.3% |
| 1.6 | 7 | 4.7% |
| 1.3 | 7 | 4.7% |
| 5.6 | 6 | 4.0% |
| 4.7 | 5 | 3.3% |
| 4.9 | 5 | 3.3% |
| 4 | 5 | 3.3% |
| Other values (33) | 73 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.7% |
| 1.1 | 1 | 0.7% |
| 1.2 | 2 | 1.3% |
| 1.3 | 7 | |
| 1.4 | 12 | |
| 1.5 | 14 | |
| 1.6 | 7 | |
| 1.7 | 4 | 2.7% |
| 1.9 | 2 | 1.3% |
| 3 | 1 | 0.7% |
| Value | Count | Frequency (%) |
| 6.9 | 1 | 0.7% |
| 6.7 | 2 | |
| 6.6 | 1 | 0.7% |
| 6.4 | 1 | 0.7% |
| 6.3 | 1 | 0.7% |
| 6.1 | 3 | |
| 6 | 2 | |
| 5.9 | 2 | |
| 5.8 | 3 | |
| 5.7 | 3 |
petal_width
Real number (ℝ)
High correlation 
| Distinct | 22 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1986667 |
| Minimum | 0.1 |
|---|---|
| Maximum | 2.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.3 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.2 |
| Q1 | 0.3 |
| median | 1.3 |
| Q3 | 1.8 |
| 95-th percentile | 2.3 |
| Maximum | 2.5 |
| Range | 2.4 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 0.76316074 |
|---|---|
| Coefficient of variation (CV) | 0.6366747 |
| Kurtosis | -1.3397542 |
| Mean | 1.1986667 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.10499656 |
| Sum | 179.8 |
| Variance | 0.58241432 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=22)
| Value | Count | Frequency (%) |
| 0.2 | 28 | |
| 1.3 | 13 | 8.7% |
| 1.8 | 12 | 8.0% |
| 1.5 | 12 | 8.0% |
| 1.4 | 8 | 5.3% |
| 2.3 | 8 | 5.3% |
| 1 | 7 | 4.7% |
| 0.4 | 7 | 4.7% |
| 0.3 | 7 | 4.7% |
| 0.1 | 6 | 4.0% |
| Other values (12) | 42 |
| Value | Count | Frequency (%) |
| 0.1 | 6 | 4.0% |
| 0.2 | 28 | |
| 0.3 | 7 | 4.7% |
| 0.4 | 7 | 4.7% |
| 0.5 | 1 | 0.7% |
| 0.6 | 1 | 0.7% |
| 1 | 7 | 4.7% |
| 1.1 | 3 | 2.0% |
| 1.2 | 5 | 3.3% |
| 1.3 | 13 |
| Value | Count | Frequency (%) |
| 2.5 | 3 | 2.0% |
| 2.4 | 3 | 2.0% |
| 2.3 | 8 | |
| 2.2 | 3 | 2.0% |
| 2.1 | 6 | |
| 2 | 6 | |
| 1.9 | 5 | |
| 1.8 | 12 | |
| 1.7 | 2 | 1.3% |
| 1.6 | 4 | 2.7% |
species
Categorical
High correlation  Uniform 
| Distinct | 3 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.3 KiB |
| Iris-setosa | |
|---|---|
| Iris-versicolor | |
| Iris-virginica |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 13.333333 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Iris-setosa |
|---|---|
| 2nd row | Iris-setosa |
| 3rd row | Iris-setosa |
| 4th row | Iris-setosa |
| 5th row | Iris-setosa |
Common Values
| Value | Count | Frequency (%) |
| Iris-setosa | 50 | |
| Iris-versicolor | 50 | |
| Iris-virginica | 50 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| iris-setosa | 50 | |
| iris-versicolor | 50 | |
| iris-virginica | 50 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 350 | |
| r | 300 | |
| s | 300 | |
| I | 150 | |
| - | 150 | |
| o | 150 | |
| e | 100 | 5.0% |
| a | 100 | 5.0% |
| v | 100 | 5.0% |
| c | 100 | 5.0% |
| Other values (4) | 200 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 350 | |
| r | 300 | |
| s | 300 | |
| I | 150 | |
| - | 150 | |
| o | 150 | |
| e | 100 | 5.0% |
| a | 100 | 5.0% |
| v | 100 | 5.0% |
| c | 100 | 5.0% |
| Other values (4) | 200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 350 | |
| r | 300 | |
| s | 300 | |
| I | 150 | |
| - | 150 | |
| o | 150 | |
| e | 100 | 5.0% |
| a | 100 | 5.0% |
| v | 100 | 5.0% |
| c | 100 | 5.0% |
| Other values (4) | 200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 350 | |
| r | 300 | |
| s | 300 | |
| I | 150 | |
| - | 150 | |
| o | 150 | |
| e | 100 | 5.0% |
| a | 100 | 5.0% |
| v | 100 | 5.0% |
| c | 100 | 5.0% |
| Other values (4) | 200 |
Interactions
Correlations
| petal_length | petal_width | sepal_length | sepal_width | species | |
|---|---|---|---|---|---|
| petal_length | 1.000 | 0.936 | 0.881 | -0.303 | 0.890 |
| petal_width | 0.936 | 1.000 | 0.834 | -0.278 | 0.924 |
| sepal_length | 0.881 | 0.834 | 1.000 | -0.159 | 0.617 |
| sepal_width | -0.303 | -0.278 | -0.159 | 1.000 | 0.437 |
| species | 0.890 | 0.924 | 0.617 | 0.437 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| sepal_length | sepal_width | petal_length | petal_width | species | |
|---|---|---|---|---|---|
| 0 | 5.1 | 3.5 | 1.4 | 0.2 | Iris-setosa |
| 1 | 4.9 | 3.0 | 1.4 | 0.2 | Iris-setosa |
| 2 | 4.7 | 3.2 | 1.3 | 0.2 | Iris-setosa |
| 3 | 4.6 | 3.1 | 1.5 | 0.2 | Iris-setosa |
| 4 | 5.0 | 3.6 | 1.4 | 0.2 | Iris-setosa |
| 5 | 5.4 | 3.9 | 1.7 | 0.4 | Iris-setosa |
| 6 | 4.6 | 3.4 | 1.4 | 0.3 | Iris-setosa |
| 7 | 5.0 | 3.4 | 1.5 | 0.2 | Iris-setosa |
| 8 | 4.4 | 2.9 | 1.4 | 0.2 | Iris-setosa |
| 9 | 4.9 | 3.1 | 1.5 | 0.1 | Iris-setosa |
| sepal_length | sepal_width | petal_length | petal_width | species | |
|---|---|---|---|---|---|
| 140 | 6.7 | 3.1 | 5.6 | 2.4 | Iris-virginica |
| 141 | 6.9 | 3.1 | 5.1 | 2.3 | Iris-virginica |
| 142 | 5.8 | 2.7 | 5.1 | 1.9 | Iris-virginica |
| 143 | 6.8 | 3.2 | 5.9 | 2.3 | Iris-virginica |
| 144 | 6.7 | 3.3 | 5.7 | 2.5 | Iris-virginica |
| 145 | 6.7 | 3.0 | 5.2 | 2.3 | Iris-virginica |
| 146 | 6.3 | 2.5 | 5.0 | 1.9 | Iris-virginica |
| 147 | 6.5 | 3.0 | 5.2 | 2.0 | Iris-virginica |
| 148 | 6.2 | 3.4 | 5.4 | 2.3 | Iris-virginica |
| 149 | 5.9 | 3.0 | 5.1 | 1.8 | Iris-virginica |
Duplicate rows
Most frequently occurring
| sepal_length | sepal_width | petal_length | petal_width | species | # duplicates | |
|---|---|---|---|---|---|---|
| 0 | 4.9 | 3.1 | 1.5 | 0.1 | Iris-setosa | 3 |
| 1 | 5.8 | 2.7 | 5.1 | 1.9 | Iris-virginica | 2 |