patents.google.com

CN110110807B - Leukocyte extraction and classification method based on improved K-means and convolutional neural network - Google Patents

  • ️Tue Jan 03 2023
Leukocyte extraction and classification method based on improved K-means and convolutional neural network Download PDF

Info

Publication number
CN110110807B
CN110110807B CN201910404623.8A CN201910404623A CN110110807B CN 110110807 B CN110110807 B CN 110110807B CN 201910404623 A CN201910404623 A CN 201910404623A CN 110110807 B CN110110807 B CN 110110807B Authority
CN
China
Prior art keywords
white blood
improved
algorithm
neural network
convolutional neural
Prior art date
2019-05-16
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201910404623.8A
Other languages
Chinese (zh)
Other versions
CN110110807A (en
Inventor
林丽群
陈柏林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuzhou University
Original Assignee
Fuzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
2019-05-16
Filing date
2019-05-16
Publication date
2023-01-03
2019-05-16 Application filed by Fuzhou University filed Critical Fuzhou University
2019-05-16 Priority to CN201910404623.8A priority Critical patent/CN110110807B/en
2019-08-09 Publication of CN110110807A publication Critical patent/CN110110807A/en
2023-01-03 Application granted granted Critical
2023-01-03 Publication of CN110110807B publication Critical patent/CN110110807B/en
Status Expired - Fee Related legal-status Critical Current
2039-05-16 Anticipated expiration legal-status Critical

Links

  • 210000000265 leukocyte Anatomy 0.000 title claims abstract description 69
  • 238000013527 convolutional neural network Methods 0.000 title claims abstract description 23
  • 238000000034 method Methods 0.000 title claims abstract description 20
  • 238000000605 extraction Methods 0.000 title claims abstract description 17
  • 238000004422 calculation algorithm Methods 0.000 claims abstract description 33
  • 230000011218 segmentation Effects 0.000 claims abstract description 23
  • 210000004027 cell Anatomy 0.000 claims abstract description 14
  • 210000000805 cytoplasm Anatomy 0.000 claims abstract description 12
  • 230000009286 beneficial effect Effects 0.000 claims abstract description 5
  • 230000006870 function Effects 0.000 claims description 15
  • 239000011159 matrix material Substances 0.000 claims description 15
  • 210000004940 nucleus Anatomy 0.000 claims description 10
  • 238000005070 sampling Methods 0.000 claims description 7
  • 210000003855 cell nucleus Anatomy 0.000 claims description 6
  • 230000001086 cytosolic effect Effects 0.000 claims description 5
  • 238000010186 staining Methods 0.000 claims description 4
  • 230000004913 activation Effects 0.000 claims description 3
  • 238000002474 experimental method Methods 0.000 claims description 3
  • 238000003064 k means clustering Methods 0.000 claims description 3
  • 210000002569 neuron Anatomy 0.000 claims description 3
  • 238000000926 separation method Methods 0.000 claims description 3
  • 239000006185 dispersion Substances 0.000 claims 1
  • 238000005259 measurement Methods 0.000 claims 1
  • 238000010606 normalization Methods 0.000 claims 1
  • 238000007635 classification algorithm Methods 0.000 abstract description 3
  • 238000000354 decomposition reaction Methods 0.000 abstract 1
  • 238000004458 analytical method Methods 0.000 description 2
  • 230000000694 effects Effects 0.000 description 2
  • 230000002159 abnormal effect Effects 0.000 description 1
  • 230000001464 adherent effect Effects 0.000 description 1
  • 210000004369 blood Anatomy 0.000 description 1
  • 239000008280 blood Substances 0.000 description 1
  • 238000009534 blood test Methods 0.000 description 1
  • 238000010276 construction Methods 0.000 description 1
  • 238000010586 diagram Methods 0.000 description 1
  • 239000003814 drug Substances 0.000 description 1
  • 210000003743 erythrocyte Anatomy 0.000 description 1
  • 238000000684 flow cytometry Methods 0.000 description 1
  • 238000003709 image segmentation Methods 0.000 description 1
  • 210000000987 immune system Anatomy 0.000 description 1
  • 239000012535 impurity Substances 0.000 description 1
  • 239000000203 mixture Substances 0.000 description 1
  • 230000000877 morphologic effect Effects 0.000 description 1

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • G06V20/695Preprocessing, e.g. image segmentation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

本发明涉及一种基于改进K‑means及卷积神经网络的白细胞提取和分类方法。首先,根据细胞图像灰度分布来选定初始聚类中心,对图像所有像素按就近原则进行初始聚类;接着,对FWSA‑KM算法的欧式距离进行改进;白细胞提取之前,先进行颜色空间分解,采用有利于白细胞分割的颜色分量和改进的K‑means算法进行细胞核和细胞质的提取;接着采用分水岭算法来分离复杂粘连的部分;最后,基于卷积神经网络进行分类。本发明方法使得白细胞细胞核和细胞质分割精度分别为95.81%和91.28%,较传统分割方法有较大提高;分类准确度最大能达到98.96%,分类平均时间为0.39s,相对于现有的白细胞分类算法,CNN分类方法不仅有明显优势,同时还有很大的提升空间。

Figure 201910404623

The invention relates to a white blood cell extraction and classification method based on improved K-means and convolutional neural network. First, select the initial cluster center according to the gray distribution of the cell image, and perform initial clustering on all pixels of the image according to the principle of proximity; then, improve the Euclidean distance of the FWSA-KM algorithm; before white blood cell extraction, first perform color space decomposition , the color components that are beneficial to white blood cell segmentation and the improved K-means algorithm are used to extract the nucleus and cytoplasm; then the watershed algorithm is used to separate the parts of complex adhesions; finally, the classification is based on the convolutional neural network. The method of the present invention makes the segmentation accuracy of leukocyte nucleus and cytoplasm respectively 95.81% and 91.28%, which is greatly improved compared with the traditional segmentation method; the maximum classification accuracy can reach 98.96%, and the average classification time is 0.39s, compared with the existing white blood cell classification Algorithm, CNN classification method not only has obvious advantages, but also has a lot of room for improvement.

Figure 201910404623

Description

一种基于改进K-means及卷积神经网络的白细胞提取和分类 方法A white blood cell extraction and classification based on improved K-means and convolutional neural network method

技术领域technical field

本发明涉及医学图像分割提取技术领域,特别是涉及一种基于改进K-means及卷积神经网络的白细胞提取和分类方法。The invention relates to the technical field of medical image segmentation and extraction, in particular to a white blood cell extraction and classification method based on improved K-means and convolutional neural network.

背景技术Background technique

在医学上,白细胞是人体免疫系统重要的组成部分,负责识别并吞噬非正常细胞。血常规检查中白血细胞的传统分类计数和形态分析依赖于人工计数和血液检查的专家分析,效率低且具有较强主观性。目前常用的是流式细胞仪,也不能实现白细胞自动分类,并且在临床应用中具有局限性。In medicine, white blood cells are an important part of the human immune system, responsible for identifying and engulfing abnormal cells. The traditional differential counting and morphological analysis of white blood cells in blood routine examination rely on manual counting and expert analysis of blood tests, which is inefficient and highly subjective. Currently, flow cytometry is commonly used, which cannot automatically classify white blood cells, and has limitations in clinical application.

近年来,为了更好地分割图像和识别出白细胞,研究者相继提出了一些效果较好的算法来实现白细胞的精确分割和分类算法,但是白细胞分割仍然还存在着问题,这些问题主要来源于图像色彩亮度不一,图像中存在杂质,白细胞形状多种多样,染色后细胞质与红细胞的颜色相近。现有的方法所能实现的分割精度还不能达到临床实际需要,因此在白细胞分割领域还有许多工作需要进行。In recent years, in order to better segment images and identify white blood cells, researchers have successively proposed some algorithms with better effects to realize accurate segmentation and classification algorithms of white blood cells, but there are still problems in white blood cell segmentation, which mainly come from image The color brightness is different, there are impurities in the image, the shape of white blood cells is various, and the color of cytoplasm and red blood cells after staining is similar. The segmentation accuracy achieved by the existing methods can not meet the actual clinical needs, so there is still a lot of work to be done in the field of white blood cell segmentation.

发明内容Contents of the invention

本发明的目的在于提供一种基于改进K-means及卷积神经网络的白细胞提取和分类方法,该方法可以有效地提取白细胞并且分割精度较高,最后利用卷积神经网络(CNN)进行白细胞分类和识别。The purpose of the present invention is to provide a leukocyte extraction and classification method based on improved K-means and convolutional neural network, which can effectively extract leukocytes and have high segmentation accuracy, and finally utilize convolutional neural network (CNN) to carry out leukocyte classification and identification.

为实现上述目的,本发明的技术方案是:一种基于改进K-means及卷积神经网络的白细胞提取和分类方法,包括如下步骤:To achieve the above object, the technical solution of the present invention is: a leukocyte extraction and classification method based on improved K-means and convolutional neural network, comprising the following steps:

步骤S1、在提取白细胞之前,先进行颜色空间的分解,采用有利于白细胞分割的颜色分量;Step S1, before extracting white blood cells, decompose the color space first, and use color components that are beneficial to white blood cell segmentation;

步骤S2、根据细胞图像灰度分布对K-means聚类算法进行改进,选定初始聚类中心,使图像中所有像素按就近原则进行初始聚类,并对FWSA-KM算法的欧式距离进行改进,使聚类算法的鲁棒性提高;Step S2, improve the K-means clustering algorithm according to the gray distribution of the cell image, select the initial clustering center, make all pixels in the image perform initial clustering according to the nearest principle, and improve the Euclidean distance of the FWSA-KM algorithm , to improve the robustness of the clustering algorithm;

步骤S3、采用改进的K-means算法进行细胞核和细胞质的提取;Step S3, using the improved K-means algorithm to extract the nucleus and cytoplasm;

步骤S4、采用分水岭算法来分离复杂粘连的白细胞部分;Step S4, using the watershed algorithm to separate the white blood cells with complex adhesions;

步骤S5、采用卷积神经网络对提取分离的白细胞进行实验,实现粘连白细胞的识别。Step S5, using the convolutional neural network to conduct experiments on the extracted and separated leukocytes to realize the identification of the adhered leukocytes.

在本发明一实施例中,所述步骤S1具体实现如下:In an embodiment of the present invention, the step S1 is specifically implemented as follows:

步骤S11、建立彩色模型:对白细胞染色处理,使其在色调分量(H)空间和饱和分量(S)空间对应的细胞质区域以及白细胞区域都与背景图像存在着较强的对比度;Step S11, establishing a color model: staining the white blood cells so that the cytoplasmic regions and white blood cell regions corresponding to the hue component (H) space and the saturation component (S) space have a strong contrast with the background image;

步骤S12、在后续分割中,设置饱和分量空间和色调分量空间中阈值,将白细胞的细胞核区域以及白细胞区域粗略地从细胞图像中提取出来。Step S12 , in the subsequent segmentation, setting thresholds in the saturation component space and the hue component space, and roughly extracting the nucleus area and the white blood cell area of the white blood cell from the cell image.

在本发明一实施例中,所述步骤S2具体实现如下:In an embodiment of the present invention, the step S2 is specifically implemented as follows:

步骤S21、对细胞图像灰度进行直方图分布统计来选定初始聚类中心,使得图像所有像素按就近原则进行初始聚类;Step S21, performing histogram distribution statistics on the gray scale of the cell image to select the initial clustering center, so that all pixels of the image are initially clustered according to the principle of proximity;

步骤S22、基于非欧氏距离,改进特征权重,对K-means进行改进,得到Improved-KM聚类算法,即将该算法目标函数中的χik和vjk的欧氏距离|χik-vjk|修改为非欧氏距离

Figure BDA0002060817180000021

由此得到改进的Improved-KM聚类算法的目标函数为:

Figure BDA0002060817180000022

Step S22, based on the non-Euclidean distance, improve the feature weight, improve K-means, and obtain the Improved-KM clustering algorithm, that is, the Euclidean distance between χ ik and v jk in the algorithm's objective function | χ ik -v jk |Modified to non-Euclidean distance

Figure BDA0002060817180000021

The objective function of the improved Improved-KM clustering algorithm is:

Figure BDA0002060817180000022

在本发明一实施例中,所述步骤S22中,Improved-KM聚类算法步骤如下:In an embodiment of the present invention, in the step S22, the steps of the Improved-KM clustering algorithm are as follows:

步骤S221、目标函数:

Figure BDA0002060817180000023

其中U=(uij)n×c是隶属度矩阵;如果第i个数据点xi属于第j个类,则uij=1,否则uij=0,并且

Figure BDA0002060817180000024

而V=[v1,v2,…,vc]是c个聚类中心构成的矩阵;同时式子满足:Step S221, objective function:

Figure BDA0002060817180000023

Where U=(u ij ) n×c is the membership degree matrix; if the i-th data point x i belongs to the j-th class, then u ij =1, otherwise u ij =0, and

Figure BDA0002060817180000024

And V=[v 1 ,v 2 ,…,v c ] is a matrix composed of c cluster centers; at the same time, the formula satisfies:

Figure BDA0002060817180000025

Figure BDA0002060817180000025

步骤S222、最优隶属度矩阵

Figure BDA0002060817180000026

和聚类中心矩阵

Figure BDA0002060817180000027

中的元素为:

Figure BDA0002060817180000028

Figure BDA0002060817180000029

Step S222, optimal membership degree matrix

Figure BDA0002060817180000026

and cluster center matrix

Figure BDA0002060817180000027

The elements in are:

Figure BDA0002060817180000028

and

Figure BDA0002060817180000029

步骤S223、通过迭代求解三个最小化问题;Step S223, solving three minimization problems through iteration;

步骤S224:令

Figure BDA00020608171800000210

Figure BDA00020608171800000211

则ak度量了聚类在第k维特征上总的类内紧致性,bk度量了聚类在第k维特征上总的类间分离性度量;Step S224: make

Figure BDA00020608171800000210

and

Figure BDA00020608171800000211

Then a k measures the total intra-class compactness of the clustering on the k-th dimension feature, and b k measures the total inter-class separation measure of the clustering on the k-th dimension feature;

步骤S225、用新的目标函数

Figure BDA0002060817180000031

求解如下特征权重矩阵:Step S225, using a new objective function

Figure BDA0002060817180000031

Solve the following feature weight matrix:

Figure BDA0002060817180000032

Figure BDA0002060817180000032

其中满足:

Figure BDA0002060817180000033

Which satisfies:

Figure BDA0002060817180000033

步骤S226、设

Figure BDA0002060817180000034

为第t步迭代的特征权重,则下式可表示为第t+1步的特征权重:

Figure BDA0002060817180000035

其中特征权重调节差量如下:Step S226, set

Figure BDA0002060817180000034

is the feature weight of the t-th step iteration, then the following formula can be expressed as the feature weight of the t+1-th step:

Figure BDA0002060817180000035

The feature weight adjustment difference is as follows:

Figure BDA0002060817180000036

Figure BDA0002060817180000036

为了使

Figure BDA0002060817180000037

满足约束条件

Figure BDA0002060817180000038

对特征权重公式进行规范化处理,得到特征权重

Figure BDA0002060817180000039

because

Figure BDA0002060817180000037

meet the constraints

Figure BDA0002060817180000038

Normalize the feature weight formula to get the feature weight

Figure BDA0002060817180000039

在本发明一实施例中,所述步骤S3具体实现如下:In an embodiment of the present invention, the step S3 is specifically implemented as follows:

步骤S31、白细胞的提取:通过观察不同细胞图像的不同色彩模型的不同分量来选择,再进行聚类分割;Step S31, extraction of white blood cells: select by observing different components of different color models of different cell images, and then perform clustering and segmentation;

步骤S32、细胞核的提取:利用聚类后的二值图像含有大量的噪声且噪声普遍面积小的特点计算出每个连通区域的面积,剔除小于阈值的连通区域面积,填充在细胞核区域之中小的孔洞;Step S32, cell nucleus extraction: calculate the area of each connected region by using the characteristic that the clustered binary image contains a lot of noise and the noise generally has a small area, remove the connected region area smaller than the threshold, and fill in the small ones in the nucleus region holes;

步骤S33、细胞质的提取:将提取出的白细胞减去白细胞核的方法来得到细胞质部分,再进行重新恢复彩色像。Step S33, cytoplasmic extraction: subtracting the white blood cell nucleus from the extracted white blood cells to obtain the cytoplasmic part, and then restore the color image.

在本发明一实施例中,所述步骤S4具体实现如下:In an embodiment of the present invention, the step S4 is specifically implemented as follows:

步骤S41、对于有粘连在一起的白细胞,进行去噪和孔洞填充处理;Step S41, performing denoising and hole filling processing for the white blood cells that have adhered together;

步骤S42、用分水岭分割算法将粘连在一起的白细胞进行分割处理。Step S42, using the watershed segmentation algorithm to segment the cohesive white blood cells.

在本发明一实施例中,所述步骤S5中,所述卷积神经网络由输入层、卷积层、采样层、连接层和输出层构成;输入层输入需要分类的图像,由卷积层提取出相应的特征,为加速学习速度通过采样层下采样减少需要神经元个数同时保留有用信息,连接层通过激活函数将分类结果输入至输出层,输出层的维度为所需分类的类别数。In one embodiment of the present invention, in the step S5, the convolutional neural network is composed of an input layer, a convolutional layer, a sampling layer, a connection layer, and an output layer; the input layer inputs images to be classified, and the convolutional layer The corresponding features are extracted. In order to speed up the learning speed, the number of neurons required is reduced by downsampling in the sampling layer while retaining useful information. The connection layer inputs the classification results to the output layer through the activation function, and the dimension of the output layer is the number of categories to be classified. .

相较于现有技术,本发明具有以下有益效果:本发明方法使得白细胞细胞核和细胞质分割精度分别为95.81%和91.28%,较传统分割方法有较大提高;分类准确度最大能达到98.96%,分类平均时间为0.39s,相对于现有的白细胞分类算法,CNN分类方法不仅有明显优势,同时还有很大的提升空间Compared with the prior art, the present invention has the following beneficial effects: the method of the present invention makes the segmentation accuracy of leukocyte nucleus and cytoplasm respectively 95.81% and 91.28%, which is greatly improved compared with the traditional segmentation method; the maximum classification accuracy can reach 98.96%, The average classification time is 0.39s. Compared with the existing white blood cell classification algorithm, the CNN classification method not only has obvious advantages, but also has a lot of room for improvement

附图说明Description of drawings

图1为本发明方法流程图。Fig. 1 is a flow chart of the method of the present invention.

图2为本发明卷积神经网络的构建图。Fig. 2 is a construction diagram of the convolutional neural network of the present invention.

具体实施方式detailed description

下面结合附图,对本发明的技术方案进行具体说明。The technical solution of the present invention will be specifically described below in conjunction with the accompanying drawings.

如图1所示,本发明提供了一种基于改进K-means及卷积神经网络的白细胞提取和分类方法,包括如下步骤:As shown in Figure 1, the present invention provides a kind of leukocyte extraction and classification method based on improved K-means and convolutional neural network, comprises the following steps:

步骤S1、在提取白细胞之前,先进行颜色空间的分解,采用有利于白细胞分割的颜色分量;具体如下:Step S1, before extracting white blood cells, decompose the color space first, and use color components that are beneficial to white blood cell segmentation; details are as follows:

步骤S11、建立彩色模型:对白细胞染色处理,使其在色调分量(H)空间和饱和分量(S)空间对应的细胞质区域以及白细胞区域都与背景图像存在着较强的对比度;Step S11, establishing a color model: staining the white blood cells so that the cytoplasmic regions and white blood cell regions corresponding to the hue component (H) space and the saturation component (S) space have a strong contrast with the background image;

步骤S12、在后续分割中,设置饱和分量空间和色调分量空间中阈值,将白细胞的细胞核区域以及白细胞区域粗略地从细胞图像中提取出来;Step S12, in the subsequent segmentation, setting thresholds in the saturation component space and the hue component space, and roughly extracting the nucleus area and the leukocyte area of the leukocyte from the cell image;

步骤S2、根据细胞图像灰度分布对K-means聚类算法进行改进,选定初始聚类中心,使图像中所有像素按就近原则进行初始聚类,并对FWSA-KM算法的欧式距离进行改进,使聚类算法的鲁棒性提高;具体如下:Step S2, improve the K-means clustering algorithm according to the gray distribution of the cell image, select the initial clustering center, make all pixels in the image perform initial clustering according to the nearest principle, and improve the Euclidean distance of the FWSA-KM algorithm , so that the robustness of the clustering algorithm is improved; the details are as follows:

步骤S21、对细胞图像灰度进行直方图分布统计来选定初始聚类中心,使得图像所有像素按就近原则进行初始聚类;Step S21, performing histogram distribution statistics on the gray scale of the cell image to select the initial clustering center, so that all pixels of the image are initially clustered according to the principle of proximity;

步骤S22、基于非欧氏距离,改进特征权重,对K-means进行改进,得到Improved-KM聚类算法,即将该算法目标函数中的χik和vjk的欧氏距离|χik-vjk|修改为非欧氏距离

Figure BDA0002060817180000051

由此得到改进的Improved-KM聚类算法的目标函数为:

Figure BDA0002060817180000052

Step S22, based on the non-Euclidean distance, improve the feature weight, improve K-means, and obtain the Improved-KM clustering algorithm, that is, the Euclidean distance between χ ik and v jk in the algorithm's objective function | χ ik -v jk |Modified to non-Euclidean distance

Figure BDA0002060817180000051

The objective function of the improved Improved-KM clustering algorithm is:

Figure BDA0002060817180000052

所述步骤S22中,Improved-KM聚类算法步骤如下:In the step S22, the steps of the Improved-KM clustering algorithm are as follows:

步骤S221、目标函数:

Figure BDA0002060817180000053

其中U=(uij)n×c是隶属度矩阵;如果第i个数据点xi属于第j个类,则uij=1,否则uij=0,并且

Figure BDA0002060817180000054

而V=[v1,v2,…,vc]是c个聚类中心构成的矩阵;同时式子满足:Step S221, objective function:

Figure BDA0002060817180000053

Where U=(u ij ) n×c is the membership degree matrix; if the i-th data point x i belongs to the j-th class, then u ij =1, otherwise u ij =0, and

Figure BDA0002060817180000054

And V=[v 1 ,v 2 ,…,v c ] is a matrix composed of c cluster centers; at the same time, the formula satisfies:

Figure BDA0002060817180000055

Figure BDA0002060817180000055

步骤S222、最优隶属度矩阵

Figure BDA0002060817180000056

和聚类中心矩阵

Figure BDA0002060817180000057

中的元素为:

Figure BDA0002060817180000058

Figure BDA0002060817180000059

Step S222, optimal membership degree matrix

Figure BDA0002060817180000056

and cluster center matrix

Figure BDA0002060817180000057

The elements in are:

Figure BDA0002060817180000058

and

Figure BDA0002060817180000059

步骤S223、通过迭代求解三个最小化问题;Step S223, solving three minimization problems through iteration;

步骤S224:令

Figure BDA00020608171800000510

Figure BDA00020608171800000511

则ak度量了聚类在第k维特征上总的类内紧致性,bk度量了聚类在第k维特征上总的类间分离性度量;Step S224: make

Figure BDA00020608171800000510

and

Figure BDA00020608171800000511

Then a k measures the total intra-class compactness of the clustering on the k-th dimension feature, and b k measures the total inter-class separation measure of the clustering on the k-th dimension feature;

步骤S225、用新的目标函数

Figure BDA00020608171800000512

求解如下特征权重矩阵:Step S225, using a new objective function

Figure BDA00020608171800000512

Solve the following feature weight matrix:

Figure BDA0002060817180000061

Figure BDA0002060817180000061

其中满足:

Figure BDA0002060817180000062

Which satisfies:

Figure BDA0002060817180000062

步骤S226、设

Figure BDA0002060817180000063

为第t步迭代的特征权重,则下式可表示为第t+1步的特征权重:

Figure BDA0002060817180000064

其中特征权重调节差量如下:Step S226, set

Figure BDA0002060817180000063

is the feature weight of the t-th step iteration, then the following formula can be expressed as the feature weight of the t+1-th step:

Figure BDA0002060817180000064

The feature weight adjustment difference is as follows:

Figure BDA0002060817180000065

Figure BDA0002060817180000065

为了使

Figure BDA0002060817180000066

满足约束条件

Figure BDA0002060817180000067

对特征权重公式进行规范化处理,得到特征权重

Figure BDA0002060817180000068

because

Figure BDA0002060817180000066

meet the constraints

Figure BDA0002060817180000067

Normalize the feature weight formula to get the feature weight

Figure BDA0002060817180000068

步骤S3、采用改进的K-means算法进行细胞核和细胞质的提取;具体如下:Step S3, using the improved K-means algorithm to extract the nucleus and cytoplasm; details are as follows:

步骤S31、白细胞的提取:通过观察不同细胞图像的不同色彩模型的不同分量来选择,再进行聚类分割;Step S31, extraction of white blood cells: select by observing different components of different color models of different cell images, and then perform clustering and segmentation;

步骤S32、细胞核的提取:利用聚类后的二值图像含有大量的噪声且噪声普遍面积小的特点计算出每个连通区域的面积,剔除小于阈值的连通区域面积,填充在细胞核区域之中小的孔洞;Step S32, cell nucleus extraction: calculate the area of each connected region by using the characteristic that the clustered binary image contains a lot of noise and the noise generally has a small area, remove the connected region area smaller than the threshold, and fill in the small ones in the nucleus region holes;

步骤S33、细胞质的提取:因为细胞质的颜色往往和白细胞或者背景图像的颜色相似,所以极难分割;因此采用将提取出的白细胞减去白细胞核的方法来得到细胞质部分,再进行重新恢复彩色像Step S33, cytoplasm extraction: because the color of cytoplasm is often similar to the color of white blood cells or the background image, it is extremely difficult to segment; therefore, the cytoplasm part is obtained by subtracting the white blood cell nucleus from the extracted white blood cells, and then the color image is restored

步骤S4、采用分水岭算法来分离复杂粘连的白细胞部分;具体如下:Step S4, using the watershed algorithm to separate the white blood cells with complex adhesions; the details are as follows:

步骤S41、对于有粘连在一起的白细胞,进行去噪和孔洞填充处理;Step S41, performing denoising and hole filling processing for the white blood cells that have adhered together;

步骤S42、用分水岭分割算法将粘连在一起的白细胞进行分割处理。Step S42, using the watershed segmentation algorithm to segment the cohesive white blood cells.

步骤S5、采用卷积神经网络对提取分离的白细胞进行实验,实现粘连白细胞的识别;如图2所示,所述卷积神经网络由输入层、卷积层、采样层、连接层和输出层构成;输入层输入需要分类的图像,由卷积层提取出相应的特征,为加速学习速度通过采样层下采样减少需要神经元个数同时保留有用信息,连接层通过激活函数(sigmoid函数等)将分类结果输入至输出层,输出层的维度为所需分类的类别数。Step S5, using the convolutional neural network to conduct experiments on the extracted and separated white blood cells to realize the identification of adherent white blood cells; as shown in Figure 2, the convolutional neural network consists of an input layer, a convolutional layer, a sampling layer, a connection layer and an output layer Composition; the input layer inputs the image that needs to be classified, and the corresponding features are extracted by the convolutional layer. In order to accelerate the learning speed, the number of neurons is reduced by downsampling the sampling layer while retaining useful information. The connection layer passes the activation function (sigmoid function, etc.) The classification result is input to the output layer, and the dimension of the output layer is the number of categories to be classified.

以上是本发明的较佳实施例,凡依本发明技术方案所作的改变,所产生的功能作用未超出本发明技术方案的范围时,均属于本发明的保护范围。The above are the preferred embodiments of the present invention, and all changes made according to the technical solution of the present invention, when the functional effect produced does not exceed the scope of the technical solution of the present invention, all belong to the protection scope of the present invention.

Claims (5)

1. A leukocyte extraction and classification method based on improved K-means and a convolutional neural network is characterized by comprising the following steps:

s1, before white blood cells are extracted, decomposing a color space, and adopting color components beneficial to white blood cell segmentation;

s2, improving a K-means clustering algorithm according to the gray level distribution of the cell image, selecting an initial clustering center, initially clustering all pixels in the image according to a nearby principle, and improving the Euclidean distance of the FWSA-KM algorithm;

s3, extracting cell nucleus and cytoplasm by adopting an improved K-means algorithm;

s4, separating the complex adhered white blood cell part by adopting a watershed algorithm;

s5, adopting a convolutional neural network to perform an experiment on the extracted and separated white blood cells to realize the identification of the adhesion white blood cells;

the step S2 is specifically realized as follows:

s21, performing histogram distribution statistics on the gray level of the cell image to select an initial clustering center, so that all pixels of the image are initially clustered according to a principle of proximity;

s22, improving the characteristic weight based on the non-Euclidean distance, and improving the K-means to obtain an Improved-KM clustering algorithm, namely, the chi in the objective function of the algorithm ik And v jk Euclidean distance | χ of ik -v jk I modified to a non-Euclidean distance

Figure FDA0003877486720000011

Improved KM the objective function of the clustering algorithm is:

Figure FDA0003877486720000012

in step S22, the Improved-KM clustering algorithm includes the following steps:

step S221, an objective function:

Figure FDA0003877486720000013

wherein U = (U) ij ) n×c Is a membership matrix; if the ith data point x i Belongs to the jth class, then u ij =1, otherwise u ij Is =0, and

Figure FDA0003877486720000014

and V = [ V ] 1 ,v 2 ,…,v c ]Is a matrix formed by c clustering centers; meanwhile, the formula satisfies:

Figure FDA0003877486720000015

step S222, an optimal membership matrix

Figure FDA0003877486720000016

And cluster center matrix

Figure FDA0003877486720000017

The elements in (A) are as follows:

Figure FDA0003877486720000021

and

Figure FDA0003877486720000022

step S223, three minimization problems are solved through iteration;

step S224: order to

Figure FDA0003877486720000023

And

Figure FDA0003877486720000024

then a is k Measure the total intra-class compactness of clustering on the kth-dimensional features, b k Measuring the total inter-class separation measurement of the clustering on the k-dimension feature;

step S225, using the new objective function

Figure FDA0003877486720000025

Solving the following feature weight matrix:

Figure FDA0003877486720000026

wherein the following are satisfied:

Figure FDA0003877486720000027

step S226, setting

Figure FDA0003877486720000028

For the feature weight of the t-th iteration, the following formula is expressed as the feature weight of the t + 1-th step:

Figure FDA0003877486720000029

wherein the feature weight adjustment dispersion is as follows:

Figure FDA00038774867200000210

to make it possible to

Figure FDA00038774867200000211

Satisfy the constraint condition

Figure FDA00038774867200000212

Carrying out normalization processing on the characteristic weight formula to obtain the characteristic weight

Figure FDA00038774867200000213

2. The method for extracting and classifying leukocytes based on improved K-means and convolutional neural network as claimed in claim 1, wherein the step S1 is implemented as follows:

step S11, establishing a color model: staining white blood cells so that a cytoplasm region and a white blood cell region corresponding to a hue component (H) space and a saturation component (S) space have strong contrast with a background image;

in the subsequent segmentation, thresholds in the saturation component space and the hue component space are set, and the cell nucleus region and the white blood cell region of the white blood cells are roughly extracted from the cell image.

3. The method for extracting and classifying leukocytes based on improved K-means and convolutional neural network as claimed in claim 1, wherein the step S3 is implemented as follows:

step S31, extraction of white blood cells: selecting by observing different components of different color models of different cell images, and then carrying out clustering segmentation;

step S32, extracting cell nucleuses: calculating the area of each connected region by using the characteristics that the clustered binary image contains a large amount of noise and the common area of the noise is small, eliminating the area of the connected region smaller than a threshold value, and filling small holes in a cell nucleus region;

step S33, cytoplasm extraction: the extracted white blood cells are subtracted from the white nucleus to obtain a cytoplasmic fraction, and the color image is restored again.

4. The method for extracting and classifying leukocytes based on improved K-means and convolutional neural network as claimed in claim 1, wherein the step S4 is implemented as follows:

s41, denoising and hole filling processing are carried out on the white blood cells adhered together;

and S42, carrying out segmentation processing on the adhered white blood cells by using a watershed segmentation algorithm.

5. The method for extracting and classifying leukocytes based on improved K-means and convolutional neural network as claimed in claim 1, wherein in step S5, the convolutional neural network is composed of an input layer, a convolutional layer, a sampling layer, a connection layer and an output layer; the input layer inputs images needing to be classified, corresponding features are extracted by the convolutional layer, the number of required neurons is reduced through sampling by the sampling layer for accelerating learning speed, meanwhile, useful information is reserved, the connection layer inputs a classification result to the output layer through an activation function, and the dimensionality of the output layer is the number of classes needing to be classified.

CN201910404623.8A 2019-05-16 2019-05-16 Leukocyte extraction and classification method based on improved K-means and convolutional neural network Expired - Fee Related CN110110807B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910404623.8A CN110110807B (en) 2019-05-16 2019-05-16 Leukocyte extraction and classification method based on improved K-means and convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910404623.8A CN110110807B (en) 2019-05-16 2019-05-16 Leukocyte extraction and classification method based on improved K-means and convolutional neural network

Publications (2)

Publication Number Publication Date
CN110110807A CN110110807A (en) 2019-08-09
CN110110807B true CN110110807B (en) 2023-01-03

Family

ID=67490357

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910404623.8A Expired - Fee Related CN110110807B (en) 2019-05-16 2019-05-16 Leukocyte extraction and classification method based on improved K-means and convolutional neural network

Country Status (1)

Country Link
CN (1) CN110110807B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115019305B (en) * 2022-08-08 2022-11-11 成都西交智汇大数据科技有限公司 Method, device and equipment for identifying root tip cells and readable storage medium
CN115661667B (en) * 2022-12-13 2023-04-07 济宁市土哥农业服务有限公司 Method for identifying impurities in descurainia sophia seeds based on computer vision

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103473739A (en) * 2013-08-15 2013-12-25 华中科技大学 White blood cell image accurate segmentation method and system based on support vector machine
CN104392460A (en) * 2014-12-12 2015-03-04 山东大学 Adherent white blood cell segmentation method based on nucleus-marked watershed transformation
CN104751462A (en) * 2015-03-29 2015-07-01 嘉善加斯戴克医疗器械有限公司 White cell segmentation method based on multi-feature nonlinear combination
CN109034045A (en) * 2018-07-20 2018-12-18 中南大学 A kind of leucocyte automatic identifying method based on convolutional neural networks

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8569356B2 (en) * 2005-10-25 2013-10-29 University Of Florida Research Foundation, Inc. Cyclin dependent kinase inhibitors

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103473739A (en) * 2013-08-15 2013-12-25 华中科技大学 White blood cell image accurate segmentation method and system based on support vector machine
CN104392460A (en) * 2014-12-12 2015-03-04 山东大学 Adherent white blood cell segmentation method based on nucleus-marked watershed transformation
CN104751462A (en) * 2015-03-29 2015-07-01 嘉善加斯戴克医疗器械有限公司 White cell segmentation method based on multi-feature nonlinear combination
CN109034045A (en) * 2018-07-20 2018-12-18 中南大学 A kind of leucocyte automatic identifying method based on convolutional neural networks

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Automatic Extraction of Fuzzy and Touching Leukocyte Using Improved FWSA K-means in Peripheral Blood and Bone Marrow Cell Images;Li-Qun Lin et al.;《JOURNAL OF COMPUTERS》;20190601;第30卷(第03期);第1-13页 *
K-means cluster algorithm based on color image enhancement for cell segmentation;Man Yan et al.;《2012 5th International Conference on BioMedical Engineering and Informatics》;20130506;第295-299页 *
改进的分数阶微分及图论的粘连血细胞图像分割;林丽群 等;《福州大学学报(自然科学版)》;20171205;第45卷(第06期);第794-800页 *

Also Published As

Publication number Publication date
CN110110807A (en) 2019-08-09

Similar Documents

Publication Publication Date Title
Tomari et al. 2014 Computer aided system for red blood cell classification in blood smear image
Sinha et al. 2003 Automation of differential blood count
JP2022137166A (en) 2022-09-21 System and method of classification of biological particles
CN106248559B (en) 2018-10-23 A kind of five sorting technique of leucocyte based on deep learning
Dhieb et al. 2019 An automated blood cells counting and classification framework using mask R-CNN deep learning model
CN109101938B (en) 2021-01-15 Multi-label age estimation method based on convolutional neural network
Othman et al. 2017 Neural network classification of white blood cell using microscopic images
CN110334706A (en) 2019-10-15 A kind of images steganalysis method and device
CN108830237B (en) 2020-08-11 Facial expression recognition method
CN107133612A (en) 2017-09-05 Based on image procossing and the intelligent ward of speech recognition technology and its operation method
Rawat et al. 2015 Review of leukocyte classification techniques for microscopic blood images
CN107330412B (en) 2021-03-26 A face age estimation method based on deep sparse representation
Ji et al. 2019 Research on urine sediment images recognition based on deep learning
Taher et al. 2012 Bayesian classification and artificial neural network methods for lung cancer early diagnosis
CN110110807B (en) 2023-01-03 Leukocyte extraction and classification method based on improved K-means and convolutional neural network
Dong et al. 2021 A self-adaptive approach for white blood cell classification towards point-of-care testing
Soeleman et al. 2012 Adaptive threshold for background subtraction in moving object detection using Fuzzy C-Means clustering
Ahasan et al. 2016 White blood cells nucleus segmentation from microscopic images of strained peripheral blood film during leukemia and normal condition
Pandit et al. 2015 Survey on automatic rbc detection and counting
Mandyartha et al. 2020 Global and adaptive thresholding technique for white blood cell image segmentation
Muhamad et al. 2022 A comparative evaluation of deep learning methods in automated classification of white blood cell images
CN110148126A (en) 2019-08-20 Blood leucocyte dividing method based on color component combination and contour fitting
CN110781828A (en) 2020-02-11 Fatigue state detection method based on micro-expression
Arunachalam 2020 Applications of Machine learning and Image processing techniques in the detection of leukemia
Heriawati et al. 2021 Blood cells classification for identification of acute lymphoblastic leukemia on microscopic images using image processing

Legal Events

Date Code Title Description
2019-08-09 PB01 Publication
2019-08-09 PB01 Publication
2019-09-03 SE01 Entry into force of request for substantive examination
2019-09-03 SE01 Entry into force of request for substantive examination
2023-01-03 GR01 Patent grant
2023-01-03 GR01 Patent grant
2024-05-24 CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20230103

2024-05-24 CF01 Termination of patent right due to non-payment of annual fee