Human-aligned Quantification of Numerical Data
November 15, 2025 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Anton Kolonin
arXiv ID
2511.15723
Category
physics.data-an
Cross-listed
cs.HC,
cs.LG,
math.NA
Citations
0
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Quantifying numerical data involves addressing two key challenges: first, determining whether the data can be naturally quantified, and second, identifying the numerical intervals or ranges of values that correspond to specific value classes, referred to as "quantums," which represent statistically meaningful states. If such quantification is feasible, continuous streams of numerical data can be transformed into sequences of "symbols" that reflect the states of the system described by the measured parameter. People often perform this task intuitively, relying on common sense or practical experience, while information theory and computer science offer computable metrics for this purpose. In this study, we assess the applicability of metrics based on information compression and the Silhouette coefficient for quantifying numerical data. We also investigate the extent to which these metrics correlate with one another and with what is commonly referred to as "human intuition." Our findings suggest that the ability to classify numeric data values into distinct categories is associated with a Silhouette coefficient above 0.65 and a Dip Test below 0.5; otherwise, the data can be treated as following a unimodal normal distribution. Furthermore, when quantification is possible, the Silhouette coefficient appears to align more closely with human intuition than the "normalized centroid distance" method derived from information compression perspective.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.data-an
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
A deep convolutional neural network approach to single-particle recognition in cryo-electron microscopy
R.I.P.
π»
Ghosted
The Pandora Software Development Kit for Pattern Recognition
R.I.P.
π»
Ghosted
Emergence of Compositional Representations in Restricted Boltzmann Machines
R.I.P.
π»
Ghosted
Investigating echo state networks dynamics by means of recurrence analysis
R.I.P.
π»
Ghosted
Discovering state-parameter mappings in subsurface models using generative adversarial networks
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted