Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Novel Method for Clustering Cellular Data to Improve Classification

Published 5 Mar 2024 in q-bio.QM | (2403.03318v1)

Abstract: Many fields, such as neuroscience, are experiencing the vast proliferation of cellular data, underscoring the need for organizing and interpreting large datasets. A popular approach partitions data into manageable subsets via hierarchical clustering, but objective methods to determine the appropriate classification granularity are missing. We recently introduced a technique to systematically identify when to stop subdividing clusters based on the fundamental principle that cells must differ more between than within clusters. Here we present the corresponding protocol to classify cellular datasets by combining data-driven unsupervised hierarchical clustering with statistical testing. These general-purpose functions are applicable to any cellular dataset that can be organized as two-dimensional matrices of numerical values, including molecular, physiological, and anatomical datasets. We demonstrate the protocol using cellular data from the Janelia MouseLight project to characterize morphological aspects of neurons.

Summary

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.