<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    abb
   </journal-id>
   <journal-title-group>
    <journal-title>
     Advances in Bioscience and Biotechnology
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    2156-8456
   </issn>
   <issn publication-format="print">
    2156-8502
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/abb.2025.1610029
   </article-id>
   <article-id pub-id-type="publisher-id">
    abb-146521
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Biomedical 
     </subject>
     <subject>
       Life Sciences
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    Integrative Cell Bin Segmentation on Spatial Transcriptomics by Voronoi
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Ming
      </surname>
      <given-names>
       Lin
      </given-names>
     </name>
    </contrib>
   </contrib-group> 
   <aff id="affnull">
    <addr-line>
     aDepartment of Mathematical Sciences, University of Nottingham, Nottingham, UK
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     16
    </day> 
    <month>
     10
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    16
   </volume> 
   <issue>
    10
   </issue>
   <fpage>
    446
   </fpage>
   <lpage>
    461
   </lpage>
   <history>
    <date date-type="received">
     <day>
      11,
     </day>
     <month>
      September
     </month>
     <year>
      2025
     </year>
    </date>
    <date date-type="published">
     <day>
      19,
     </day>
     <month>
      September
     </month>
     <year>
      2025
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      19,
     </day>
     <month>
      October
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    Spatial transcriptomics is undergoing rapid advancements and iterations. It is a beneficial tool to significantly enhance our understanding of tissue organization and relationships between cells. Recent technological advancements have achieved subcellular resolution, providing much denser spot placement for downstream analysis. A key challenge for this following analysis is accurate cell segmentation and the assignment of spots to individual cells. The primary objective of this study was to evaluate the effectiveness of a new cell segmentation approach based on subcellular level spatial transcriptomic data by confirming nuclei positions and using Voronoi diagrams, compared to direct clustering with cellbin data. Our findings demonstrate that the Voronoi method not only outperforms traditional methods in providing clearer boundaries and better separation of cell types, but also excels in preserving the most transcripts, addressing the issue of low capture efficiency. This integrative methodology presents a substantial advancement in spatial transcriptomics, offering improved cell type classification and spatial pattern recognition. 
   </abstract>
   <kwd-group> 
    <kwd>
     Spatial Transcriptomics
    </kwd> 
    <kwd>
      Voronoi
    </kwd> 
    <kwd>
      Bioinformatics
    </kwd> 
    <kwd>
      High Resolution
    </kwd> 
    <kwd>
      Cell Segmentation
    </kwd> 
    <kwd>
      Clustering
    </kwd> 
    <kwd>
      Tissue Integration
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>1. Introduction</title>
   <p>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>Spatial transcriptomics can significantly enhance our comprehension of tissue arrangement and intercellular communications <xref ref-type="bibr" rid="scirp.146521-1">
     [1]
    </xref> <xref ref-type="bibr" rid="scirp.146521-2">
     [2]
    </xref>, preserving spatial information lost in traditional single-cell RNA sequencing (scRNA-seq) <xref ref-type="bibr" rid="scirp.146521-3">
     [3]
    </xref>. This information is significant for analyzing cancer <xref ref-type="bibr" rid="scirp.146521-4">
     [4]
    </xref>, finding relationships between different genes <xref ref-type="bibr" rid="scirp.146521-5">
     [5]
    </xref>, and discovering drugs <xref ref-type="bibr" rid="scirp.146521-6">
     [6
    </xref><xref ref-type="bibr" rid="scirp.146521-6">
     ]
    </xref>. In current biomedical research, spatial transcriptomic technologies can be generally divided into two categories: imaging-based technologies (IST) and sequencing-based technologies (SST) <xref ref-type="bibr" rid="scirp.146521-7">
     [7]
    </xref>. Imaging-based methods, such as MERFISH <xref ref-type="bibr" rid="scirp.146521-8">
     [8]
    </xref> and seqFISH+ <xref ref-type="bibr" rid="scirp.146521-9">
     [9]
    </xref>, can obtain high-resolution gene expression information but are limited in their ability to detect all gene types. In contrast, sequencing-based methods, which benefit from their efficiency and high throughput, originally faced the challenge of low resolution. These initial SST methods, such as Visium <xref ref-type="bibr" rid="scirp.146521-10">
     [10]
    </xref>, only have 55 μm (center-to-center) capture areas, capturing gene expression from multiple cells within each spatial area, making it difficult to discern single-cell level details <xref ref-type="bibr" rid="scirp.146521-11">
     [11]
    </xref>.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>Recent advancements have addressed some of these limitations. For instance, the development of Slide-seqV2 <xref ref-type="bibr" rid="scirp.146521-11">
     [11]
    </xref> has improved the spatial resolution of SST to 2 μm, enabling near single-cell resolution by capturing transcripts with higher efficiency and spatial precision. The advent of the Stereo-seq (Spatial Enhanced Resolution Omics Sequencing) method from BGI Spatial has further enhanced the capture density to 0.5 μm (center-to-center). Stereo-seq integrates high-resolution spatial barcoding with next-generation sequencing, enabling the capture of gene expression at subcellular resolution <xref ref-type="bibr" rid="scirp.146521-12">
     [12]
    </xref>, demonstrating the highest capturing ability among the current SST method <xref ref-type="bibr" rid="scirp.146521-13">
     [13]
    </xref>. This improvement allows for precise mapping and analysis of gene expression within individual cells, significantly improving our ability to study the intricate spatial dynamics of tissues and facilitating a deeper understanding of cellular functions. <xref ref-type="bibr" rid="scirp.146521-14">
     [14]
    </xref></p>
   <p>Despite these advancements, critical analysis reveals that current methods still have room for improvement <xref ref-type="bibr" rid="scirp.146521-13">
     [13]
    </xref>. IST methods, while offering high resolution, require complex and time-consuming procedures. SST methods, despite enhancements in resolution, still struggle with the comprehensive capture of spatial context at a single-cell level <xref ref-type="bibr" rid="scirp.146521-15">
     [15]
    </xref>. Therefore, there is a continuous need for innovative approaches that combine the strengths of both IST and SST methods to achieve more precise and comprehensive spatial transcriptomic analyses <xref ref-type="bibr" rid="scirp.146521-16">
     [16]
    </xref>.</p>
   <p>When we obtain the gene expression information by the above methods, accurate cell segmentation is essential before we ultimately cluster the cells together and perform downstream analyses <xref ref-type="bibr" rid="scirp.146521-17">
     [17]
    </xref>. The challenge of attaining precise, automated cell segmentation primarily stems from variations in cell morphology, dimensions, and distribution within different tissue types <xref ref-type="bibr" rid="scirp.146521-18">
     [18]
    </xref>.</p>
   <p>Conventional imagee-based segmentation methods in sequencing-based technologies are constrained and fail to fully harness the information provided by spatial transcriptomics profiling, since they only record the area of the nucleus instead of the whole cell <xref ref-type="bibr" rid="scirp.146521-18">
     [18]
    </xref>. Some original methods use the watershed algorithm <xref ref-type="bibr" rid="scirp.146521-19">
     [19]
    </xref> to find the cell boundaries, while other recent methods design deep learning-based cell segmentation algorithms to handle complex tissue images, including TissueNet <xref ref-type="bibr" rid="scirp.146521-20">
     [20]
    </xref>, GeneSegNet <xref ref-type="bibr" rid="scirp.146521-21">
     [21]
    </xref>, Cellpose <xref ref-type="bibr" rid="scirp.146521-22">
     [22]
    </xref>, and SCS <xref ref-type="bibr" rid="scirp.146521-23">
     [23]
    </xref>. TissueNet is designed to accurately identify and segment cells in high-dimensional tissue images, leveraging a neural network trained on a diverse set of annotated images to generalize well across different tissue types and imaging modalities.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>For high-resolution spatial transcriptomics, SCS was designed by integrating sequencing and imaging data, utilizing a transformer neural network to adaptively learn the position of each spot relative to its cell center, finally enhancing cell segmentation and achieving greater accuracy compared to current methods. However, those methods still face some drawbacks due to the requirement of supervision <xref ref-type="bibr" rid="scirp.146521-24">
     [24]
    </xref>, low capture efficiency, and lengthy code runtimes <xref ref-type="bibr" rid="scirp.146521-25">
     [25]
    </xref>, which can be addressed in our method.</p>
   <p>In this study, we introduce the Voronoi method <xref ref-type="bibr" rid="scirp.146521-26">
     [26]
    </xref>, which is designed based on nuclei-based data and contains larger gene expression data. Built on the BGI method, our Voronoi method will optimize their cell segmentation method by utilizing Voronoi segmentation <xref ref-type="bibr" rid="scirp.146521-27">
     [27]
    </xref> combined with nuclei-based spatial data, providing more accurate cell type clustering and a larger dataset that preserves most transcripts for downstream analysis.</p>
   <p>This method utilizes spatial transcriptomics data, specifically cellbin.gef and tissue.gef, for cell segmentation and gene expression analysis. The cellbin.gef file contains gene expression data at the nuclear level, where each entry corresponds to a nucleus’s position and its associated gene expression profile. In contrast, the tissue.gef file contains gene expression data at the instrument-detected uniform spot level (e.g., DNA nanoballs, DNB), which are evenly distributed across the entire tissue slice and record spatial transcriptomics information.</p>
   <p>First, we utilized cellbin data to determine the exact location of nuclei and defined them as the centers of each cell region. Then, the Voronoi diagram was employed to delineate cell boundaries, assigning each nucleus to a unique region index. We assumed each region represents a cell. Gene information from tissue data was then mapped onto these segmented regions, obtaining a larger cellbin dataset consisting of fourfold numbers of gene expressions. Clustering and further downstream analysis were performed by using the Stereopy toolkit <xref ref-type="bibr" rid="scirp.146521-28">
     [28]
    </xref> and Mapmycells <xref ref-type="bibr" rid="scirp.146521-29">
     [29]
    </xref> to prove the dominant advantage of the Voronoi method.</p>
  </sec><sec id="s2">
   <title>2. Methods</title>
   <sec id="s2_1">
    <title>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>2.1. Data Source</title>
    <p>The data used in this study were obtained from Stereopy, a Python package developed by BGI Spatial for spatial transcriptomics analysis. The main dataset, the MouseBrain Demo, includes two GEF files: SS200000135TL_D1.cellbin.gef and SS200000135TL_D1.tissue.gef. GEF files provide spatial coordinates, gene expression data, and metadata crucial for understanding spatial gene expression. Cellbin data represent gene expression at the cell nucleus level, while tissue data include expression data for spots within the mouse brain tissue. The dataset can be downloaded from the following URL: <xref ref-type="bibr" rid="scirp.146521-http://upload.dcs.cloud:8090/share/bb6fab82-2c16-46b2-a95e-6931338f31bf">
      http://upload.dcs.cloud:8090/share/bb6fab82-2c16-46b2-a95e-6931338f31bf
     </xref>.</p>
    <p>The data used in this study were entirely obtained from publicly available resources. Specifically, we used the SS200000135TL_D1 dataset provided by BGI Research, which includes spatial transcriptomic data derived from mouse brain tissue. The dataset was generated and released independently by BGI, and is publicly accessible at: <xref ref-type="bibr" rid="scirp.146521-https://enfile.stomics.tech/SS200000135TL_D1.report.html">
      https://enfile.stomics.tech/SS200000135TL_D1.report.html
     </xref>. No new animal experiments or tissue collection were performed by the authors in the course of this study.</p>
   </sec>
   <sec id="s2_2">
    <title>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>2.2. Voronoi Segmentation Method</title>
    <p>The Voronoi tessellation was implemented using the scipy.spatial.Voronoi code in Python, which computes partitions based on Euclidean distances from the nuclei seed points. To ensure that all Voronoi regions were properly closed within the tissue boundary, we applied a boundary extension procedure by introducing pseudo-random points at the periphery of the convex hull. This approach allowed the algorithm to generate bounded polygons for previously unbounded cells, resulting in a complete and biologically realistic segmentation of the tissue section.</p>
   </sec>
   <sec id="s2_3">
    <title>2.3. BGI Data Processing Method</title>
    <p>The BGI process consists of Preprocessing, Embedding, and Clustering.</p>
    <p>Preprocessing is crucial for ensuring data quality and preparing it for subsequent analysis. The preprocessing steps are shown as follows:</p>
    <p>1) Data filtering: To ensure the accuracy and reliability of the analysis, we perform quality control: remove all missing values and outliers from the dataset, as well as cells having too many mitochondrial genes expressed, cells without enough genes expressed, and cells exceeding the count range. Here, we delete the cells whose number of genes that have non-zero counts is less than 3.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>2) Normalization <xref ref-type="bibr" rid="scirp.146521-30">
      [30]
     </xref>: Scaling the data to ensure that all features contribute equally to the analysis. This involves adjusting the values measured on different scales to a common scale. Methods for normalization are normalize_total <xref ref-type="bibr" rid="scirp.146521-31">
      [31]
     </xref> and log1p <xref ref-type="bibr" rid="scirp.146521-32">
      [32]
     </xref>.</p>
    <p>3) Filtering: Highly variable genes are then selected based on predefined criteria to focus the analysis on the most pertinent features. The steropy package provides preloaded tools to handle and preprocess such spatial datasets effectively.</p>
    <p>Embedding refers to transforming high-dimensional data into a lower-dimensional space to facilitate visualization and analysis.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>1) Dimensionality Reduction: Techniques such as PCA are applied to reduce the dimensionality of the data while preserving its intrinsic structure. Only highly variable genes are taken into consideration in this step. After that, we calculate the neighborhood graph <xref ref-type="bibr" rid="scirp.146521-33">
      [33]
     </xref> of cells and use the UMAP method <xref ref-type="bibr" rid="scirp.146521-34">
      [34]
     </xref> with the help of the PCA representation of the expression matrix.</p>
    <p>2) Visualization: The lower-dimensional embeddings are visualized to observe patterns and relationships within the data. UMAP is used to help in understanding the data’s underlying structure and distribution.</p>
    <p>Clustering is the process of grouping similar data points together based on their features:</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>1) Algorithm Selection: Leiden algorithm, which has been proven to fit spatial transcriptomics better than Louvain, is selected <xref ref-type="bibr" rid="scirp.146521-35">
      [35]
     </xref>.</p>
    <p>2) Cluster Assignment: Each data point was assigned to a cluster, and the resulting clusters were analyzed to understand their characteristics. Clusters can be visualized in scatter plots, and clustering effects can be evaluated by UMAP.</p>
   </sec>
   <sec id="s2_4">
    <title>2.4. Clustering Evaluation Method</title>
    <p>The Silhouette Score measures the cohesion and separation of clusters. It quantifies how similar each data point is to its own cluster compared to other clusters. The score ranges from −1 to 1, where a higher value indicates better-defined and more distinct clusters.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>The Silhouette Score for point 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        i 
      </mi> 
     </math> is defined as: 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         s 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          i 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mrow> 
         <mi>
           b 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            i 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <mo>
           − 
         </mo> 
         <mi>
           a 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            i 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mrow> 
         <mtext>
           max 
         </mtext> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mi>
             a 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              i 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
           <mo>
             , 
           </mo> 
           <mi>
             b 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              i 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mfrac> 
      </mrow> 
     </math>, where 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         a 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          i 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the average intra-cluster distance, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         b 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          i 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the average nearest-cluster distance.</p>
    <p>The Calinski-Harabasz Index, also known as the Variance Ratio Criterion, evaluates the ratio of between-cluster variance to within-cluster variance. Higher values indicate better-defined clusters.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>Set 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mtext> 
       </mtext> 
      </mrow> 
     </math> as the number of clusters, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        n 
      </mi> 
     </math> as the total number of data points, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mi>
          B 
        </mi> 
       </msub> 
      </mrow> 
     </math> be the between-cluster dispersion matrix, and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mtext> 
       </mtext> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mi>
          w 
        </mi> 
       </msub> 
      </mrow> 
     </math> be the within-cluster dispersion matrix.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>The index is computed as: 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         C 
       </mi> 
       <mi>
         H 
       </mi> 
       <mtext> 
       </mtext> 
       <mi>
         I 
       </mi> 
       <mi>
         n 
       </mi> 
       <mi>
         d 
       </mi> 
       <mi>
         e 
       </mi> 
       <mi>
         x 
       </mi> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mrow> 
         <mtext> 
         </mtext> 
         <mrow> 
          <mrow> 
           <mi>
             t 
           </mi> 
           <mi>
             r 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <msub> 
              <mi>
                S 
              </mi> 
              <mi>
                B 
              </mi> 
             </msub> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mo>
            / 
          </mo> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               k 
             </mi> 
             <mtext> 
             </mtext> 
             <mo>
               − 
             </mo> 
             <mn>
               1 
             </mn> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </mrow> 
        </mrow> 
        <mrow> 
         <mtext> 
         </mtext> 
         <mrow> 
          <mrow> 
           <mi>
             t 
           </mi> 
           <mi>
             r 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <msub> 
              <mi>
                S 
              </mi> 
              <mi>
                W 
              </mi> 
             </msub> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mo>
            / 
          </mo> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               n 
             </mi> 
             <mtext> 
             </mtext> 
             <mo>
               − 
             </mo> 
             <mi>
               k 
             </mi> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </mrow> 
        </mrow> 
       </mfrac> 
       <mtext> 
       </mtext> 
      </mrow> 
     </math>.</p>
    <p>Here, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         t 
       </mi> 
       <mi>
         r 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            S 
          </mi> 
          <mi>
            B 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the trace of the between-cluster dispersion matrix, and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         t 
       </mi> 
       <mi>
         r 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            S 
          </mi> 
          <mtext>
            w 
          </mtext> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the trace of the within-cluster dispersion matrix.</p>
   </sec>
   <sec id="s2_5">
    <title>2.5. Single-Cell Data Analysis with MapMyCells</title>
    <p>Single-cell transcriptomic data were analyzed using MapMyCell <xref ref-type="bibr" rid="scirp.146521-29">
      [29]
     </xref>, a comprehensive software suite designed for the integration, visualization, and interpretation of single-cell RNA sequencing (scRNA-seq) data. The software supports the integration of multiple datasets and allows for the comparison of cell populations across different conditions or treatments. Key features include customizable visualization options, such as t-SNE and UMAP plots, which facilitate the identification of distinct cell types and states.</p>
   </sec>
  </sec><sec id="s3">
   <title>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>3. Results</title>
   <p>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>To validate the effectiveness of the Voronoi segmentation method, we first applied it to a small section of mouse brain tissue, consisting of 72 cells (300 × 300 µm). This initial test demonstrated the feasibility of our approach on a manageable scale. Encouraged by these results, we expanded the analysis to a 3000 × 3000 µm section of the mouse brain to assess the impact of integrating tissue data on a larger and more complex dataset. The Voronoi method continued to demonstrate robust performance, effectively delineating cell regions and integrating genetic information with greater precision than the original cellbin dataset.</p>
   <sec id="s3_1">
    <title>3.1. The Generation of Voronoi</title>
    <p>The generation of the Voronoi diagram was a pivotal step in our analysis. By using the spatial coordinates of nuclei from the cellbin data, we defined the centers of each cell region, which the Voronoi method then used to delineate boundaries (<xref ref-type="fig" rid="fig1(a)">
      Figure 1(a)
     </xref>). This approach ensured that each nucleus was assigned a unique region, thereby accurately representing individual cells. By using the Voronoi Segmentation method, we divided the mouse brain section into certain regions that have the same number as the number of nuclei (cells) based on the nuclear position, as shown in <xref ref-type="fig" rid="fig1(b)">
      Figure 1(b)
     </xref>. Cartesian coordinates of vertices and boundaries can be obtained so that each region has its own range presented in Cartesian coordinates and is assigned a unique region index (<xref ref-type="fig" rid="fig1(c)">
      Figure 1(c)
     </xref>).</p>
    <fig id="fig1" position="float">
     <label>Figure 1</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 1. Visualization of cell locations and Voronoi segmentation method. (a) The cell locations obtained from the cell.gef file are shown in this raster plot, illustrating a specific region for 72 nuclei. The x-axis ranges from 12,000 to 12,300, and the y-axis ranges from 10,000 to 10,300. Each blue dot represents an individual cell, which serves as the center point for the Voronoi segmentation method; (b) This diagram demonstrates the Voronoi segmentation applied to the cell locations. Each solid black line represents the boundary of a cell region, generated such that any point within a region is closer to its corresponding cell (center point) than to any other cell; (c) This visualization assigns unique region indices to each Voronoi cell. Different colors represent distinct regions, and each region is labeled with a specific index.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId39.jpeg?20251022112637" />
    </fig>
   </sec>
   <sec id="s3_2">
    <title>3.2. Integration of Tissue Data</title>
    <p>Recognizing that cellbin data primarily includes information about the locations of cell nuclei as detected during staining, it represents only a small fraction of the spatial landscape within the tissue slice. This limitation restricts the comprehensive analysis of gene expression across the entire tissue. In contrast, tissue data encompasses the entire tissue slice, providing a broader and more detailed spatial transcriptomic profile. To overcome the spatial limitations of cellbin data, we integrated tissue data with the cellbin nuclei locations. Allocated by the spatial coordinate, we map the denser tissue data onto the regions defined by the cellbin nuclei and Voronoi boundary and effectively quadruple the total number of assigned transcripts per cell region and gene expression information within each region, resulting in a more complete dataset that captures gene expression across the entire tissue slice.</p>
   </sec>
   <sec id="s3_3">
    <title>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>3.3. Using Stereopy to Process and Analyze the Dataset</title>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>After integrating tissue data into each region using the Voronoi method, we created a new, enriched dataset—referred to as the “Voronoi dataset.” This dataset offers a more comprehensive view of gene expression across the entire tissue section. To understand the extent of the improvements introduced by the Voronoi method, we compared this new dataset with the original cellbin dataset, which we’ll refer to as “Original dataset”, and refer to the “Original Method” as the way we derive “Original Dataset.”</p>
    <p>Both datasets maintain the same structural format: an information matrix where rows correspond to spatial positions and columns represent different gene types. To evaluate the impact of the Voronoi method, we used the BGI toolkit to process and analyze, aiming to objectively assess whether the Voronoi method enhances the clustering quality and spatial resolution of the resulting analysis.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>Our analysis began with a 3000 × 3000 μm section of the mouse brain, carefully selected to test the method’s effectiveness in a complex tissue environment. As illustrated in <xref ref-type="fig" rid="fig2">
      Figure 2
     </xref>, we compared the spatial distribution of clusters identified using the Leiden clustering method after PCA.</p>
    <fig id="fig2" position="float">
     <label>Figure 2</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 2. Comparison of Leiden clustering results using different cell segmentation methods in spatial transcriptomics (a) Leiden spatial distribution of the original method; (b) Leiden spatial distribution of the Voronoi method. Colors represent different clusters as identified by the Leiden algorithm. The scale bar represents 400.0 μm in (a) and (b).</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId40.jpeg?20251022112638" />
    </fig>
    <p>
     <xref ref-type="fig" rid="fig2(a)">
      Figure 2(a)
     </xref> displays the clustering results from the Original dataset, with each color corresponding to a specific cell type as indicated in the legend. The clusters, while distinguishable, show some overlap and blurred boundaries, indicating challenges in accurately defining cell types. This limitation is particularly evident in the center of the tissue, where different cell types are densely packed. In contrast, <xref ref-type="fig" rid="fig2(b)">
      Figure 2(b)
     </xref> shows the clustering results after applying the Voronoi method. These clusters are more refined and distinct, with clearer separation between different cell types.</p>
    <p>
     <xref ref-type="fig" rid="fig3(a)">
      Figure 3(a)
     </xref> and <xref ref-type="fig" rid="fig3(b)">
      Figure 3(b)
     </xref> further highlight the method’s impact on specific clusters. In the original dataset, clusters appeared dispersed with indistinct boundaries, complicating conclusions about spatial organization. However, the Voronoi method significantly improved the clarity and concentration of clusters, aligning well with known anatomical structures in the mouse brain, underscoring its accuracy in cell classification.</p>
    <fig id="fig3" position="float">
     <label>Figure 3</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 3. Comparison of Leiden clustering for specific clusters (a) Leiden spatial clustering of the original method for specific clusters 4 and 9, others are shown in grey; (b) Leiden spatial clustering of the Voronoi method for specific clusters 4 and 5, others are shown in grey. Red and blue dots represent two clusters identified by the Leiden algorithm in the same layer. The scale bar represents 400.0 μm in both (a) and (b).</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId41.jpeg?20251022112638" />
    </fig>
    <fig id="fig4" position="float">
     <label>Figure 4</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 4. Comparison of Umap spatial distribution using different methods (a) UMAP spatial distribution of the original method; (b) UMAP spatial distribution of the Voronoi method. Both panels display the UMAP embedding of the clustering results obtained using the original and Voronoi methods. Each point represents a cell, and the colors correspond to different clusters as identified by the Leiden algorithm.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId42.jpeg?20251022112639" />
    </fig>
    <p>Moving to a broader analysis, <xref ref-type="fig" rid="fig4">
      Figure 4
     </xref> compares UMAP embeddings from both methods. The Voronoi method consistently outperforms the Original method, producing more distinct and less overlapping clusters. This result indicates more accurate identification of cellular identities, which is critical for understanding the complex spatial dynamics within tissues.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>To quantify the improvements, we calculated the Silhouette Score and Calinski-Harabasz Index for both methods, where higher values in both indices denote more distinct and well-defined clusters. The Voronoi method achieved a significantly higher Silhouette Score (0.166) and Calinski-Harabasz Index (2474.823) compared to the Original method (0.113 and 1256.078, respectively). These enhancements confirm that the Voronoi method produces more coherent and distinct clusters, reflecting better spatial separation and cluster compactness.</p>
    <p>Moreover, the Voronoi method captured a higher total count of genes and identified more non-zero gene counts within each cell, as shown in <xref ref-type="table" rid="table1">
      Table 1
     </xref>. These comparisons suggest that the Voronoi method not only improves detection of gene expression diversity but also enhances spatial resolution and clustering results, offering a more detailed and precise view of the tissue’s molecular landscape.</p>
    <table-wrap id="table1">
     <label>
      <xref ref-type="table" rid="table1">
       Table 1
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Table 1. Comparison of dataset and clustering results between original and Voronoi methods.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td custom-top-td acenter" width="44.44%"><p style="text-align:center">Criteria</p></td> 
       <td class="custom-bottom-td custom-top-td acenter" width="51.13%"><p style="text-align:center">Original method</p></td> 
       <td class="custom-bottom-td custom-top-td acenter" width="30.81%"><p style="text-align:center">Voronoi method</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="44.44%"><p style="text-align:center">Total counts of genes</p></td> 
       <td class="custom-top-td acenter" width="51.13%"><p style="text-align:center">22,805,800</p></td> 
       <td class="custom-top-td acenter" width="30.81%"><p style="text-align:center">90,040,768</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="44.44%"><p style="text-align:center">Number of gene count</p></td> 
       <td class="acenter" width="51.13%"><p style="text-align:center">2,442,459</p></td> 
       <td class="acenter" width="30.81%"><p style="text-align:center">10,507,885</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="44.44%"><p style="text-align:center">Silhouette Score</p></td> 
       <td class="acenter" width="51.13%"><p style="text-align:center">0.113</p></td> 
       <td class="acenter" width="30.81%"><p style="text-align:center">0.166</p></td> 
      </tr> 
      <tr> 
       <td class="custom-bottom-td acenter" width="44.44%"><p style="text-align:center">Calinski-Harabasz Index</p></td> 
       <td class="custom-bottom-td acenter" width="51.13%"><p style="text-align:center">1256.078</p></td> 
       <td class="custom-bottom-td acenter" width="30.81%"><p style="text-align:center">2474.823</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>In conclusion, the Voronoi segmentation method demonstrates substantial improvements in cell type classification and spatial analysis compared to traditional approaches. It provides a more accurate and clearer representation of spatial layers in transcriptomic data, aligning better with known anatomical structures. This method not only accurately delineates cortical layers (e.g., L1 and L5) and the boundary between the striatum and neocortex, consistent with the anatomical references provided by the Allen Brain Atlas <xref ref-type="bibr" rid="scirp.146521-38">
      [38]
     </xref>, but also facilitates studying layer-specific cell-cell interactions, e.g., excitatory/inhibitory balance across layers or differential expression of signaling ligands, paving the way for deeper insights into cellular function and tissue organization.</p>
   </sec>
   <sec id="s3_4">
    <title>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>3.4. Mapmycells Method Analysis</title>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>To further evaluate the effectiveness of the Voronoi method in analyzing spatial transcriptomics data, we employed the MapMyCells <xref ref-type="bibr" rid="scirp.146521-29">
      [29]
     </xref> tool. This tool is designed to map scRNA-seq and spatial transcriptomics data to cell types with bootstrapping probability, enabling scientists to compare their transcriptomics and spatial data against Allen Institute’s datasets. We analyzed both the original and Voronoi-processed datasets to evaluate how well each method captured the spatial distribution of cell types across a full brain tissue section.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>Initially, a smaller slice (3000*3000 μm) was analyzed, but to better learn the distribution of cell types and capture more contextual information across the entire mouse brain (11,000 × 16,000 μm), the analysis was expanded to a full brain tissue section. The following <xref ref-type="fig" rid="fig5">
      Figure 5
     </xref> shows scatter plots representing entire mouse brain data processed by MapMyCells and categorized by cell type. Each plot visualizes the distribution of cell class within the brain slice, with distinct colors indicating various cell types as indicated by the class number on the right. This figure allows for a comparison of the clustering quality between the original method and the Voronoi method.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref></p>
    <fig id="fig5" position="float">
     <label>Figure 5</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 5. Comparison of mouse brain slice scatter plots after MapMyCells process (a) Scatter plot by original method. Each color represents a class of cell type, ranging from class 1 to class 34; (b) Scatter plot by Voronoi method. Each color represents a cell type class, ranging from class 1 to class 34. The X and Y axes represent spatial coordinates within the brain slice.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId43.jpeg?20251022112640" />
    </fig>
    <p>
     <xref ref-type="fig" rid="fig5(a)">
      Figure 5(a)
     </xref> shows the results from the Original dataset. Here, the scatter plot reveals some overlap between clusters, particularly in the central part, where different cell types are mixed. This overlap suggests that the Original method may struggle to clearly separate different cell types, leading to potential inaccuracies in cell type identification and spatial distribution analysis. In contrast, <xref ref-type="fig" rid="fig5(b)">
      Figure 5(b)
     </xref> shows the Voronoi method, with less overlap and more distinct clusters, indicating improved clustering performance. The Voronoi method’s enhanced separation between clusters allows for a clearer and more accurate spatial representation, which is critical for understanding cell interactions and functions within the tissue.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>Mouse brain’s cerebral cortex is divided into six layers, each serving distinct functions and composed of various cell subtypes based on their spatial positions <xref ref-type="bibr" rid="scirp.146521-39">
      [39]
     </xref>. To further evaluate clustering quality, we focused on Layer 2, which has a unique cellular composition. MapMyCells classified cells into specific subtypes, and the spatial distribution of these subtypes was compared against known anatomical references to assess dataset quality.</p>
    <p>
     <xref ref-type="fig" rid="fig6">
      Figure 6
     </xref> consists of two scatter plots illustrating the distribution of unique cell subtypes that belong to layer 2 with bootstrapping probability. The cell subtypes represented in the plots can be regarded as marker subtypes of layer 2 <xref ref-type="bibr" rid="scirp.146521-40">
      [40]
     </xref>.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref></p>
    <fig id="fig6" position="float">
     <label>Figure 6</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 6. Comparison of the spatial distribution of layer 2 in the entire slice by Mapmycells (a) Scatter plot of layer 2 by the original method; (b) Scatter plot of layer 2 by the Voronoi method. Each point represents a cell, with colors corresponding to different subtypes. The transparency of each point indicates confidence in cell subtype classification, with darker points signifying higher probabilities.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId44.jpeg?20251022112641" />
    </fig>
    <p>
     <xref ref-type="fig" rid="fig6">
      Figure 6
     </xref> illustrates the distribution of key marker subtypes for Layer 2 detected in both datasets. The Voronoi method (<xref ref-type="fig" rid="fig6(b)">
      Figure 6(b)
     </xref>) displays a denser and more defined boundary for Layer 2 compared to the Original method (<xref ref-type="fig" rid="fig6(a)">
      Figure 6(a)
     </xref>), which shows lighter and less distinct boundaries with significant noise. The improved clarity and boundary definition in the Voronoi dataset suggest higher confidence in subtype classification, reflected in the darker points indicating higher bootstrapping probabilities. The reduction of noise further supports the Voronoi method’s superior simulation of the ground truth distribution, reinforcing its effectiveness in accurately mapping cellular subtypes.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146521-"></xref>To further assess the sharp performance of the Voronoi method in comparison to the Original method, we conducted a detailed analysis on Layer 5 of the mouse brain. Instead of using the standard x-axis or y-axis for analysis since it cannot indicate the edge sharpness in the irregular layer pattern, we selected a custom axis because direct analysis along either axis fails to capture the true spatial distribution of cells, where the distance represents the depth of the cortex.</p>
    <fig id="fig7" position="float">
     <label>Figure 7</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146521-"></xref>Figure 7. Quantified Analysis of Layer 5 (a) Scatter plot of Original Method with custom axis: The distribution of cells in Layer 5 using the Original method, plotted along a custom axis (red line) represented by the equation y = −0.85x + 22,400; (b) Scatter plot of Voronoi Method with custom axis: The cell distribution in Layer 5 using the Voronoi method, with the same custom axis; (c) Gaussian fit of distances from the custom axis: A frequency plot showing the distribution of distances of cells from the custom axis, with Gaussian fitting curves for both methods; (d) Bar plot comparing R<sup>2</sup> and Chi-square values: Comparison of the R<sup>2</sup> values and Chi-square statistics for the Gaussian fits of the Original and Voronoi methods.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302225-rId45.jpeg?20251022112642" />
    </fig>
    <p>Using this custom axis, we measured the distribution of cells and evaluated the fit of the data to the custom line to quantify how well each method captured the non-linear spatial dynamics of cell distributions. <xref ref-type="fig" rid="fig7(a)">
      Figure 7(a)
     </xref> and <xref ref-type="fig" rid="fig7(b)">
      Figure 7(b)
     </xref> present scatter plots of the cells in Layer 5 using the Original and Voronoi methods, respectively, along with the custom line fitted to the data. This custom axis, shown in red, is represented by the equation 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         y 
       </mi> 
       <mo>
         = 
       </mo> 
       <mo>
         − 
       </mo> 
       <mn>
         0.85 
       </mn> 
       <mtext>
         x 
       </mtext> 
       <mo>
         + 
       </mo> 
       <mn>
         22400 
       </mn> 
      </mrow> 
     </math>, and serves as a reference for analyzing the spatial organization of the cells relative to this axis.</p>
    <p>To quantify the differences between the two methods, we measured the distances of each cell from the custom line and performed a Gaussian fit on the distribution of these distances, as shown in <xref ref-type="fig" rid="fig7(c)">
      Figure 7(c)
     </xref>. The frequency plot illustrates the distribution of distances for both methods, with the Original method represented by the blue curve and the Voronoi method by the orange curve.</p>
    <p>The Gaussian fitting results reveal that the Voronoi method produces a narrower distribution, with a lower mean distance (1539.74) and a smaller standard deviation (1117.62), compared to the Original method (mean = 1961.87, std = 1562.78). This indicates that the Voronoi method aligns the cells more closely with the custom axis, further supporting its superior ability to capture the true spatial dynamics of the tissue.</p>
    <p>Finally, <xref ref-type="fig" rid="fig7(d)">
      Figure 7(d)
     </xref> presents a comparison of two key statistical metrics: the R<sup>2</sup> value and the Chi-square value, both of which assess the goodness-of-fit for the Gaussian distributions. The bar plots on the left side of the figure show that the Voronoi method achieves a significantly higher R<sup>2</sup> value (0.733) compared to the Original method (0.530), indicating a better fit to the custom line. On the right, the Chi-square test results also favor the Voronoi method, with a substantially lower Chi-square value (1748) compared to the Original method (29,822), further confirming the improved performance of the Voronoi method.</p>
   </sec>
  </sec><sec id="s4">
   <title>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>4. Conclusion</title>
   <p>
    <xref ref-type="bibr" rid="scirp.146521-"></xref>The comparison between the clustering capability from the original method and the Voronoi method proves that the Voronoi method significantly improves cell type clustering quality. The Voronoi method achieves better separation and clearer boundaries between cell types, as well as more compact and coherent clusters and sharper distributions. These improvements are both visually and quantitatively evident and support the use of the Voronoi method for more accurate and reliable cell type clustering in spatial transcriptomics data.</p>
  </sec><sec id="s5">
   <title>Note</title>
   <p>All codes in the paper are available at the GitHub repository: <xref ref-type="bibr" rid="scirp.146521-https://github.com/Charlottttttte/Cell_Segmentation_Voronoi">
     https://github.com/Charlottttttte/Cell_Segmentation_Voronoi
    </xref></p>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.146521-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hu, W., Zhang, Y., Mei, J. and Fang, X. (2023) Spatial Transcriptomics in Human Biomedical Research and Clinical Application. Current Medicine, 2, Article No. 6. &gt;https://doi.org/10.1007/s44194-023-00023-4 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Efremova, M., Vento-Tormo, M., Teichmann, S.A. and Vento-Tormo, R. (2020) Cellphonedb: Inferring Cell-Cell Communication from Combined Expression of Multi-Subunit Ligand–Receptor Complexes. Nature Protocols, 15, 1484-1506. &gt;https://doi.org/10.1038/s41596-020-0292-x 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Larsson, L., Frisén, J. and Lundeberg, J. (2021) Spatially Resolved Transcriptomics Adds a New Dimension to Genomics. Nature Methods, 18, 15-18. &gt;https://doi.org/10.1038/s41592-020-01038-7 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jin, Y., Zuo, Y., Li, G., Liu, W., Pan, Y., Fan, T., et al. (2024) Advances in Spatial Transcriptomics and Its Applications in Cancer Research. Molecular Cancer, 23, Article No. 129. &gt;https://doi.org/10.1186/s12943-024-02040-9 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hu, J., Li, X., Coleman, K., Schroeder, A., Ma, N., Irwin, D.J., et al. (2021) SpaGCN: Integrating Gene Expression, Spatial Location and Histology to Identify Spatial Domains and Spatially Variable Genes by Graph Convolutional Network. Nature Methods, 18, 1342-1351. &gt;https://doi.org/10.1038/s41592-021-01255-8 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Cao, J., Li, C., Cui, Z., Deng, S., Lei, T., Liu, W., et al. (2024) Spatial Transcriptomics: A Powerful Tool in Disease Understanding and Drug Discovery. Theranostics, 14, 2946-2968. &gt;https://doi.org/10.7150/thno.95908 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Williams, C.G., Lee, H.J., Asatsuma, T., Vento-Tormo, R. and Haque, A. (2022) An Introduction to Spatial Transcriptomics for Biomedical Research. Genome Medicine, 14, Article No. 68. &gt;https://doi.org/10.1186/s13073-022-01075-1 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Chen, K.H., Boettiger, A.N., Moffitt, J.R., Wang, S. and Zhuang, X. (2015) Spatially Resolved, Highly Multiplexed RNA Profiling in Single Cells. Science, 348, aaa6090. &gt;https://doi.org/10.1126/science.aaa6090 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Eng, C.L., Lawson, M., Zhu, Q., Dries, R., Koulena, N., Takei, Y., et al. (2019) Transcriptome-scale Super-Resolved Imaging in Tissues by RNA SeqFISH. Nature, 568, 235-239. &gt;https://doi.org/10.1038/s41586-019-1049-y 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ståhl, P.L., Salmén, F., Vickovic, S., Lundmark, A., Navarro, J.F., Magnusson, J., et al. (2016) Visualization and Analysis of Gene Expression in Tissue Sections by Spatial Transcriptomics. Science, 353, 78-82. &gt;https://doi.org/10.1126/science.aaf2403 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Chen, T., You, L., Hardillo, J.A.U. and Chien, M. (2023) Spatial Transcriptomic Technologies. Cells, 12, Article 2042. &gt;https://doi.org/10.3390/cells12162042 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Xia, K., Sun, H., Li, J., Li, J., Zhao, Y., Chen, L., et al. (2022) The Single-Cell Stereo-Seq Reveals Region-Specific Cell Subtypes and Transcriptome Profiling in Arabidopsis Leaves. Developmental Cell, 57, 1299-1310. &gt;https://doi.org/10.1016/j.devcel.2022.04.011 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref13">
    <label>13</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     You, Y., Fu, Y.T., Li, L.X., Zhang, Z.M., et al. (2024) Systematic Comparison of Sequencing-Based Spatial Transcriptomic Methods. Nature Methods, 21, 1743-1754. 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref14">
    <label>14</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Chen, A., Liao, S., Cheng, M.N., et al. (2022) Spatiotemporal Transcriptomic Atlas of Mouse Organogenesis Using DNA Nanoball-Patterned Arrays. Cell, 185, 1777-1792.
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref15">
    <label>15</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Li, Y. and Luo, Y. (2024) STdGCN: Spatial Transcriptomic Cell-Type Deconvolution Using Graph Convolutional Networks. Genome Biology, 25, Article No. 206. &gt;https://doi.org/10.1186/s13059-024-03353-0 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref16">
    <label>16</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Fang, S., Chen, B., Zhang, Y., Sun, H., Liu, L., Liu, S., et al. (2023) Computational Approaches and Challenges in Spatial Transcriptomics. Genomics, Proteomics&amp;Bioinformatics, 21, 24-47. &gt;https://doi.org/10.1016/j.gpb.2022.10.001 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref17">
    <label>17</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Zormpas, E., Queen, R., Comber, A. and Cockell, S.J. (2023) Mapping the Transcriptome: Realizing the Full Potential of Spatial Data Analysis. Cell, 186, 5677-5689. &gt;https://doi.org/10.1016/j.cell.2023.11.003 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref18">
    <label>18</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ma, J., Xie, R., Ayyadhury, S., Ge, C., Gupta, A., Gupta, R., et al. (2024) The Multimodality Cell Segmentation Challenge: Toward Universal Solutions. Nature Methods, 21, 1103-1113. &gt;https://doi.org/10.1038/s41592-024-02233-6 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref19">
    <label>19</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Gamarra, M., Zurek, E., Escalante, H.J., Hurtado, L. and San-Juan-Vergara, H. (2019) Split and Merge Watershed: A Two-Step Method for Cell Segmentation in Fluorescence Microscopy Images. Biomedical Signal Processing and Control, 53, Article 101575. &gt;https://doi.org/10.1016/j.bspc.2019.101575 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref20">
    <label>20</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Greenwald, N.F., Miller, G., Moen, E., Kong, A., Kagel, A., Dougherty, T., et al. (2022) Whole-cell Segmentation of Tissue Images with Human-Level Performance Using Large-Scale Data Annotation and Deep Learning. Nature Biotechnology, 40, 555-565. &gt;https://doi.org/10.1038/s41587-021-01094-0 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref21">
    <label>21</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Wang, Y.X., Wang, W.G., Liu, D.F., et al. (2023) GeneSegNet: A Deep Learning Framework for Cell Segmentation by Integrating Gene Expression and Imaging. Genome Biology, 24, Article No. 235. &gt;https://doi.org/10.1186/s13059-023-03054-0
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref22">
    <label>22</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Stringer, C., Wang, T., Michaelos, M., et al. (2020) Cellpose: A Generalist Algorithm for Cellular Segmentation. Nature Methods, 18, 100-106.
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref23">
    <label>23</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Chen, H., Li, D. and Bar-Joseph, Z. (2023) SCS: Cell Segmentation for High-Resolution Spatial Transcriptomics. Nature Methods, 20, 1237-1243. &gt;https://doi.org/10.1038/s41592-023-01939-3 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref24">
    <label>24</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Xue, S., Zhu, F., Wang, C. and Min, W. (2024) StEnTrans: Transformer-Based Deep Learning for Spatial Transcriptomics Enhancement. In: Lecture Notes in Computer Science, Springer, 63-75. &gt;https://doi.org/10.1007/978-981-97-5128-0_6 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref25">
    <label>25</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Fu, X.H., Lin, Y.X., Lin, D.M., et al. (2024) BIDCell: Biologically-Informed Self-Supervised Learning for Segmentation of Subcellular Spatial Transcriptomics Data. Nature Communications, 15, Article No. 509. 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref26">
    <label>26</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Aurenhammer, F. (1991) Voronoi Diagrams—A Survey of a Fundamental Geometric Data Structure. ACM Computing Surveys, 23, 345-405. &gt;https://doi.org/10.1145/116873.116880 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref27">
    <label>27</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Senechal, M., Okabe, A., Boots, B. and Sugihara, K. (1995) Spatial Tessellations: Concepts and Applications of Voronoi Diagrams. The College Mathematics Journal, 26, 79-81. &gt;https://doi.org/10.2307/2687299 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref28">
    <label>28</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Fang, S.S., Xu, M.Y., Cao, L., et al. (2023) Stereopy: Modeling Comparative and Spatiotemporal Cellular Heterogeneity via Multi-Sample Spatial Transcriptomics. Nature Communications, 16, Article No. 3741.
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref29">
    <label>29</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Yao, Z., van Velthoven, C.T.J., Kunst, M., Zhang, M., McMillen, D., Lee, C., et al. (2023) A High-Resolution Transcriptomic and Spatial Atlas of Cell Types in the Whole Mouse Brain. Nature, 624, 317-332. &gt;https://doi.org/10.1038/s41586-023-06812-z 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref30">
    <label>30</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hafemeister, C. and Satija, R. (2019) Normalization and Variance Stabilization of Single-Cell RNA-Seq Data Using Regularized Negative Binomial Regression. Genome Biology, 20, Article 296. 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref31">
    <label>31</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Satija, R., Farrell, J.A., Gennert, D., Schier, A.F. and Regev, A. (2015) Spatial Reconstruction of Single-Cell Gene Expression Data. Nature Biotechnology, 33, 495-502. &gt;https://doi.org/10.1038/nbt.3192
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref32">
    <label>32</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Booeshaghi, A.S. and Pachter, L. (2020) Normalization of Single-Cell RNA-Seq Counts by Log(x+1)* or log(1+x)*. Bioinformatics, 37, 2223-2224. 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref33">
    <label>33</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Marshall, J.L., Noel, T., Wang, Q.S., Chen, H., Murray, E., Subramanian, A., et al. (2022) High-Resolution Slide-Seqv2 Spatial Transcriptomics Enables Discovery of Disease-Specific Cell Neighborhoods and Pathways. iScience, 25, Article 104097. &gt;https://doi.org/10.1016/j.isci.2022.104097 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref34">
    <label>34</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     McInnes, L., Healy, J., Saul, N. and Großberger, L. (2018) UMAP: Uniform Manifold Approximation and Projection. Journal of Open Source Software, 3, Article 861. &gt;https://doi.org/10.21105/joss.00861 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref35">
    <label>35</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Traag, V.A., Waltman, L. and van Eck, N.J. (2019) From Louvain to Leiden: Guaranteeing Well-Connected Communities. Scientific Reports, 9, Article No. 5233. &gt;https://doi.org/10.1038/s41598-019-41695-z 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref36">
    <label>36</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Rousseeuw, P.J. (1987) Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. Journal of Computational and Applied Mathematics, 20, 53-65. &gt;https://doi.org/10.1016/0377-0427(87)90125-7 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref37">
    <label>37</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Wang, X. and Xu, Y.S. (2019) An Improved Index for Clustering Validation Based on Silhouette Index and Calinski-Harabasz Index. IOP Conference Series: Materials Science and Engineering, 569, Article 052024. &gt;https://doi.org/10.1088/1757-899x/569/5/052024 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref38">
    <label>38</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Wang, Q., Ding, S., Li, Y., Royall, J., Feng, D., Lesnar, P., et al. (2020) The Allen Mouse Brain Common Coordinate Framework: A 3D Reference Atlas. Cell, 181, 936-953.e20. &gt;https://doi.org/10.1016/j.cell.2020.04.007 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref39">
    <label>39</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     BICCN: The First Complete Cell Census and Atlas of a Mammalian Brain. &gt;https://www.nature.com/immersive/d42859-023-00069-2/index.html 
    </mixed-citation>
   </ref>
   <ref id="scirp.146521-ref40">
    <label>40</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Harris, K.D. and Shepherd, G.M.G. (2015) The Neocortical Circuit: Themes and Variations. Nature Neuroscience, 18, 170-181. &gt;https://doi.org/10.1038/nn.3917
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>