Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 344 |
Missing cells | 19 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 24.3 KiB |
Average record size in memory | 72.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 4 |
rowid is highly overall correlated with bill_length_mm and 3 other fields | High correlation |
bill_length_mm is highly overall correlated with rowid and 4 other fields | High correlation |
bill_depth_mm is highly overall correlated with flipper_length_mm and 2 other fields | High correlation |
flipper_length_mm is highly overall correlated with bill_length_mm and 4 other fields | High correlation |
body_mass_g is highly overall correlated with bill_length_mm and 3 other fields | High correlation |
species is highly overall correlated with rowid and 5 other fields | High correlation |
island is highly overall correlated with rowid and 2 other fields | High correlation |
sex is highly overall correlated with bill_length_mm and 2 other fields | High correlation |
year is highly overall correlated with rowid | High correlation |
sex has 11 (3.2%) missing values | Missing |
rowid is uniformly distributed | Uniform |
rowid has unique values | Unique |
Reproduction
Analysis started | 2023-11-17 19:32:31.787172 |
---|---|
Analysis finished | 2023-11-17 19:32:40.846371 |
Duration | 9.06 seconds |
Software version | ydata-profiling vv4.6.1 |
Download configuration | config.json |
rowid
Real number (ℝ)
HIGH CORRELATION
  UNIFORM
  UNIQUE
 
Distinct | 344 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 172.5 |
Minimum | 1 |
---|---|
Maximum | 344 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 18.15 |
Q1 | 86.75 |
median | 172.5 |
Q3 | 258.25 |
95-th percentile | 326.85 |
Maximum | 344 |
Range | 343 |
Interquartile range (IQR) | 171.5 |
Descriptive statistics
Standard deviation | 99.448479 |
---|---|
Coefficient of variation (CV) | 0.57651292 |
Kurtosis | -1.2 |
Mean | 172.5 |
Median Absolute Deviation (MAD) | 86 |
Skewness | 0 |
Sum | 59340 |
Variance | 9890 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.3% |
227 | 1 | 0.3% |
235 | 1 | 0.3% |
234 | 1 | 0.3% |
233 | 1 | 0.3% |
232 | 1 | 0.3% |
231 | 1 | 0.3% |
230 | 1 | 0.3% |
229 | 1 | 0.3% |
228 | 1 | 0.3% |
Other values (334) | 334 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
344 | 1 | |
343 | 1 | |
342 | 1 | |
341 | 1 | |
340 | 1 | |
339 | 1 | |
338 | 1 | |
337 | 1 | |
336 | 1 | |
335 | 1 |
species
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 KiB |
Adelie | |
---|---|
Gentoo | |
Chinstrap |
Common Values
Value | Count | Frequency (%) |
Adelie | 152 | |
Gentoo | 124 | |
Chinstrap | 68 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
adelie | 152 | |
gentoo | 124 | |
chinstrap | 68 |
Most occurring characters
Value | Count | Frequency (%) |
e | 428 | |
o | 248 | |
i | 220 | |
n | 192 | |
t | 192 | |
A | 152 | 6.7% |
d | 152 | 6.7% |
l | 152 | 6.7% |
G | 124 | 5.5% |
C | 68 | 3.0% |
Other values (5) | 340 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1924 | |
Uppercase Letter | 344 | 15.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 428 | |
o | 248 | |
i | 220 | |
n | 192 | |
t | 192 | |
d | 152 | 7.9% |
l | 152 | 7.9% |
h | 68 | 3.5% |
s | 68 | 3.5% |
r | 68 | 3.5% |
Other values (2) | 136 | 7.1% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 152 | |
G | 124 | |
C | 68 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2268 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 428 | |
o | 248 | |
i | 220 | |
n | 192 | |
t | 192 | |
A | 152 | 6.7% |
d | 152 | 6.7% |
l | 152 | 6.7% |
G | 124 | 5.5% |
C | 68 | 3.0% |
Other values (5) | 340 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2268 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 428 | |
o | 248 | |
i | 220 | |
n | 192 | |
t | 192 | |
A | 152 | 6.7% |
d | 152 | 6.7% |
l | 152 | 6.7% |
G | 124 | 5.5% |
C | 68 | 3.0% |
Other values (5) | 340 |
island
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 KiB |
Biscoe | |
---|---|
Dream | |
Torgersen |
Common Values
Value | Count | Frequency (%) |
Biscoe | 168 | |
Dream | 124 | |
Torgersen | 52 | 15.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
biscoe | 168 | |
dream | 124 | |
torgersen | 52 | 15.1% |
Most occurring characters
Value | Count | Frequency (%) |
e | 396 | |
r | 228 | |
s | 220 | |
o | 220 | |
B | 168 | |
i | 168 | |
c | 168 | |
D | 124 | 5.9% |
a | 124 | 5.9% |
m | 124 | 5.9% |
Other values (3) | 156 | 7.4% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1752 | |
Uppercase Letter | 344 | 16.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 396 | |
r | 228 | |
s | 220 | |
o | 220 | |
i | 168 | |
c | 168 | |
a | 124 | 7.1% |
m | 124 | 7.1% |
g | 52 | 3.0% |
n | 52 | 3.0% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 168 | |
D | 124 | |
T | 52 | 15.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2096 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 396 | |
r | 228 | |
s | 220 | |
o | 220 | |
B | 168 | |
i | 168 | |
c | 168 | |
D | 124 | 5.9% |
a | 124 | 5.9% |
m | 124 | 5.9% |
Other values (3) | 156 | 7.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2096 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 396 | |
r | 228 | |
s | 220 | |
o | 220 | |
B | 168 | |
i | 168 | |
c | 168 | |
D | 124 | 5.9% |
a | 124 | 5.9% |
m | 124 | 5.9% |
Other values (3) | 156 | 7.4% |
bill_length_mm
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 164 |
---|---|
Distinct (%) | 48.0% |
Missing | 2 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.92193 |
Minimum | 32.1 |
---|---|
Maximum | 59.6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 32.1 |
---|---|
5-th percentile | 35.7 |
Q1 | 39.225 |
median | 44.45 |
Q3 | 48.5 |
95-th percentile | 51.995 |
Maximum | 59.6 |
Range | 27.5 |
Interquartile range (IQR) | 9.275 |
Descriptive statistics
Standard deviation | 5.4595837 |
---|---|
Coefficient of variation (CV) | 0.124302 |
Kurtosis | -0.87602697 |
Mean | 43.92193 |
Median Absolute Deviation (MAD) | 4.75 |
Skewness | 0.053118067 |
Sum | 15021.3 |
Variance | 29.807054 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
41.1 | 7 | 2.0% |
45.2 | 6 | 1.7% |
46.5 | 5 | 1.5% |
46.2 | 5 | 1.5% |
50.5 | 5 | 1.5% |
39.6 | 5 | 1.5% |
50 | 5 | 1.5% |
45.5 | 5 | 1.5% |
37.8 | 5 | 1.5% |
47.5 | 4 | 1.2% |
Other values (154) | 290 |
Value | Count | Frequency (%) |
32.1 | 1 | |
33.1 | 1 | |
33.5 | 1 | |
34 | 1 | |
34.1 | 1 | |
34.4 | 1 | |
34.5 | 1 | |
34.6 | 2 | |
35 | 2 | |
35.1 | 1 |
Value | Count | Frequency (%) |
59.6 | 1 | |
58 | 1 | |
55.9 | 1 | |
55.8 | 1 | |
55.1 | 1 | |
54.3 | 1 | |
54.2 | 1 | |
53.5 | 1 | |
53.4 | 1 | |
52.8 | 1 |
bill_depth_mm
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 80 |
---|---|
Distinct (%) | 23.4% |
Missing | 2 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.15117 |
Minimum | 13.1 |
---|---|
Maximum | 21.5 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 13.1 |
---|---|
5-th percentile | 13.9 |
Q1 | 15.6 |
median | 17.3 |
Q3 | 18.7 |
95-th percentile | 20 |
Maximum | 21.5 |
Range | 8.4 |
Interquartile range (IQR) | 3.1 |
Descriptive statistics
Standard deviation | 1.9747932 |
---|---|
Coefficient of variation (CV) | 0.11514044 |
Kurtosis | -0.90686609 |
Mean | 17.15117 |
Median Absolute Deviation (MAD) | 1.5 |
Skewness | -0.14346463 |
Sum | 5865.7 |
Variance | 3.899808 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17 | 12 | 3.5% |
15 | 10 | 2.9% |
18.6 | 10 | 2.9% |
17.9 | 10 | 2.9% |
18.5 | 10 | 2.9% |
17.3 | 9 | 2.6% |
18.9 | 9 | 2.6% |
19 | 9 | 2.6% |
17.8 | 9 | 2.6% |
18.1 | 9 | 2.6% |
Other values (70) | 245 |
Value | Count | Frequency (%) |
13.1 | 1 | 0.3% |
13.2 | 1 | 0.3% |
13.3 | 1 | 0.3% |
13.4 | 1 | 0.3% |
13.5 | 2 | 0.6% |
13.6 | 1 | 0.3% |
13.7 | 6 | |
13.8 | 4 | |
13.9 | 4 | |
14 | 2 | 0.6% |
Value | Count | Frequency (%) |
21.5 | 1 | 0.3% |
21.2 | 2 | |
21.1 | 3 | |
20.8 | 1 | 0.3% |
20.7 | 3 | |
20.6 | 1 | 0.3% |
20.5 | 1 | 0.3% |
20.3 | 3 | |
20.2 | 1 | 0.3% |
20.1 | 1 | 0.3% |
flipper_length_mm
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 55 |
---|---|
Distinct (%) | 16.1% |
Missing | 2 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 200.9152 |
Minimum | 172 |
---|---|
Maximum | 231 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 172 |
---|---|
5-th percentile | 181 |
Q1 | 190 |
median | 197 |
Q3 | 213 |
95-th percentile | 225 |
Maximum | 231 |
Range | 59 |
Interquartile range (IQR) | 23 |
Descriptive statistics
Standard deviation | 14.061714 |
---|---|
Coefficient of variation (CV) | 0.0699883 |
Kurtosis | -0.98427289 |
Mean | 200.9152 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 0.34568183 |
Sum | 68713 |
Variance | 197.73179 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
190 | 22 | 6.4% |
195 | 17 | 4.9% |
187 | 16 | 4.7% |
193 | 15 | 4.4% |
210 | 14 | 4.1% |
191 | 13 | 3.8% |
215 | 12 | 3.5% |
197 | 10 | 2.9% |
196 | 10 | 2.9% |
185 | 9 | 2.6% |
Other values (45) | 204 |
Value | Count | Frequency (%) |
172 | 1 | 0.3% |
174 | 1 | 0.3% |
176 | 1 | 0.3% |
178 | 4 | |
179 | 1 | 0.3% |
180 | 5 | |
181 | 7 | |
182 | 3 | |
183 | 2 | 0.6% |
184 | 7 |
Value | Count | Frequency (%) |
231 | 1 | 0.3% |
230 | 7 | |
229 | 2 | 0.6% |
228 | 4 | |
226 | 1 | 0.3% |
225 | 4 | |
224 | 3 | |
223 | 2 | 0.6% |
222 | 6 | |
221 | 5 |
body_mass_g
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 94 |
---|---|
Distinct (%) | 27.5% |
Missing | 2 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4201.7544 |
Minimum | 2700 |
---|---|
Maximum | 6300 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 2700 |
---|---|
5-th percentile | 3150 |
Q1 | 3550 |
median | 4050 |
Q3 | 4750 |
95-th percentile | 5650 |
Maximum | 6300 |
Range | 3600 |
Interquartile range (IQR) | 1200 |
Descriptive statistics
Standard deviation | 801.95454 |
---|---|
Coefficient of variation (CV) | 0.19086183 |
Kurtosis | -0.71922187 |
Mean | 4201.7544 |
Median Absolute Deviation (MAD) | 600 |
Skewness | 0.47032933 |
Sum | 1437000 |
Variance | 643131.08 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3800 | 12 | 3.5% |
3700 | 11 | 3.2% |
3900 | 10 | 2.9% |
3950 | 10 | 2.9% |
3550 | 9 | 2.6% |
4300 | 8 | 2.3% |
3400 | 8 | 2.3% |
4400 | 8 | 2.3% |
3450 | 8 | 2.3% |
3500 | 7 | 2.0% |
Other values (84) | 251 |
Value | Count | Frequency (%) |
2700 | 1 | 0.3% |
2850 | 2 | |
2900 | 4 | |
2925 | 1 | 0.3% |
2975 | 1 | 0.3% |
3000 | 2 | |
3050 | 4 | |
3075 | 1 | 0.3% |
3100 | 1 | 0.3% |
3150 | 4 |
Value | Count | Frequency (%) |
6300 | 1 | 0.3% |
6050 | 1 | 0.3% |
6000 | 2 | 0.6% |
5950 | 2 | 0.6% |
5850 | 3 | |
5800 | 2 | 0.6% |
5750 | 1 | 0.3% |
5700 | 5 | |
5650 | 3 | |
5600 | 2 | 0.6% |
sex
Categorical
HIGH CORRELATION
  MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 0.6% |
Missing | 11 |
Missing (%) | 3.2% |
Memory size | 2.8 KiB |
male | |
---|---|
female |
Common Values
Value | Count | Frequency (%) |
male | 168 | |
female | 165 | |
(Missing) | 11 | 3.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
male | 168 | |
female | 165 |
Most occurring characters
Value | Count | Frequency (%) |
e | 498 | |
m | 333 | |
a | 333 | |
l | 333 | |
f | 165 | 9.9% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1662 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 498 | |
m | 333 | |
a | 333 | |
l | 333 | |
f | 165 | 9.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1662 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 498 | |
m | 333 | |
a | 333 | |
l | 333 | |
f | 165 | 9.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1662 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 498 | |
m | 333 | |
a | 333 | |
l | 333 | |
f | 165 | 9.9% |
year
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 KiB |
2009 | |
---|---|
2008 | |
2007 |
Common Values
Value | Count | Frequency (%) |
2009 | 120 | |
2008 | 114 | |
2007 | 110 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2009 | 120 | |
2008 | 114 | |
2007 | 110 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 688 | |
2 | 344 | |
9 | 120 | 8.7% |
8 | 114 | 8.3% |
7 | 110 | 8.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1376 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 688 | |
2 | 344 | |
9 | 120 | 8.7% |
8 | 114 | 8.3% |
7 | 110 | 8.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1376 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 688 | |
2 | 344 | |
9 | 120 | 8.7% |
8 | 114 | 8.3% |
7 | 110 | 8.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1376 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 688 | |
2 | 344 | |
9 | 120 | 8.7% |
8 | 114 | 8.3% |
7 | 110 | 8.0% |
rowid | bill_length_mm | bill_depth_mm | flipper_length_mm | body_mass_g | species | island | sex | year | |
---|---|---|---|---|---|---|---|---|---|
rowid | 1.000 | 0.775 | -0.253 | 0.483 | 0.275 | 0.953 | 0.687 | 0.000 | 0.745 |
bill_length_mm | 0.775 | 1.000 | -0.222 | 0.673 | 0.584 | 0.650 | 0.324 | 0.520 | 0.117 |
bill_depth_mm | -0.253 | -0.222 | 1.000 | -0.523 | -0.432 | 0.635 | 0.484 | 0.586 | 0.107 |
flipper_length_mm | 0.483 | 0.673 | -0.523 | 1.000 | 0.840 | 0.701 | 0.501 | 0.448 | 0.219 |
body_mass_g | 0.275 | 0.584 | -0.432 | 0.840 | 1.000 | 0.605 | 0.456 | 0.589 | 0.000 |
species | 0.953 | 0.650 | 0.635 | 0.701 | 0.605 | 1.000 | 0.657 | 0.000 | 0.000 |
island | 0.687 | 0.324 | 0.484 | 0.501 | 0.456 | 0.657 | 1.000 | 0.000 | 0.058 |
sex | 0.000 | 0.520 | 0.586 | 0.448 | 0.589 | 0.000 | 0.000 | 1.000 | 0.000 |
year | 0.745 | 0.117 | 0.107 | 0.219 | 0.000 | 0.000 | 0.058 | 0.000 | 1.000 |
rowid | species | island | bill_length_mm | bill_depth_mm | flipper_length_mm | body_mass_g | sex | year | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | Adelie | Torgersen | 39.1 | 18.7 | 181.0 | 3750.0 | male | 2007 |
1 | 2 | Adelie | Torgersen | 39.5 | 17.4 | 186.0 | 3800.0 | female | 2007 |
2 | 3 | Adelie | Torgersen | 40.3 | 18.0 | 195.0 | 3250.0 | female | 2007 |
3 | 4 | Adelie | Torgersen | NaN | NaN | NaN | NaN | NaN | 2007 |
4 | 5 | Adelie | Torgersen | 36.7 | 19.3 | 193.0 | 3450.0 | female | 2007 |
5 | 6 | Adelie | Torgersen | 39.3 | 20.6 | 190.0 | 3650.0 | male | 2007 |
6 | 7 | Adelie | Torgersen | 38.9 | 17.8 | 181.0 | 3625.0 | female | 2007 |
7 | 8 | Adelie | Torgersen | 39.2 | 19.6 | 195.0 | 4675.0 | male | 2007 |
8 | 9 | Adelie | Torgersen | 34.1 | 18.1 | 193.0 | 3475.0 | NaN | 2007 |
9 | 10 | Adelie | Torgersen | 42.0 | 20.2 | 190.0 | 4250.0 | NaN | 2007 |
rowid | species | island | bill_length_mm | bill_depth_mm | flipper_length_mm | body_mass_g | sex | year | |
---|---|---|---|---|---|---|---|---|---|
334 | 335 | Chinstrap | Dream | 50.2 | 18.8 | 202.0 | 3800.0 | male | 2009 |
335 | 336 | Chinstrap | Dream | 45.6 | 19.4 | 194.0 | 3525.0 | female | 2009 |
336 | 337 | Chinstrap | Dream | 51.9 | 19.5 | 206.0 | 3950.0 | male | 2009 |
337 | 338 | Chinstrap | Dream | 46.8 | 16.5 | 189.0 | 3650.0 | female | 2009 |
338 | 339 | Chinstrap | Dream | 45.7 | 17.0 | 195.0 | 3650.0 | female | 2009 |
339 | 340 | Chinstrap | Dream | 55.8 | 19.8 | 207.0 | 4000.0 | male | 2009 |
340 | 341 | Chinstrap | Dream | 43.5 | 18.1 | 202.0 | 3400.0 | female | 2009 |
341 | 342 | Chinstrap | Dream | 49.6 | 18.2 | 193.0 | 3775.0 | male | 2009 |
342 | 343 | Chinstrap | Dream | 50.8 | 19.0 | 210.0 | 4100.0 | male | 2009 |
343 | 344 | Chinstrap | Dream | 50.2 | 18.7 | 198.0 | 3775.0 | female | 2009 |