Fast adaptively balanced min-cut clustering
#0.论文信息
- 标题:Fast adaptively balanced min-cut clustering
- 期刊:Pattern Recognition
- 作者: Feiping Nie , Fangyuan Xie , Jingyu Wang ,Xuelong Li
- 机构: China Telecom, Northwestern Polytechnic al University.
- 代码链接:
#1.摘要
最小切割聚类是一种典型的图聚类方法,已厂泛应用于模式识别、数据分析和图像处理。然而,最小切割有平凡解,这导致了倾斜的聚类性能。谱聚类(SC)通过将指标矩阵放松为连续嵌入,然后离散嵌入来缓解这一问题。然而,SC存在两个主要的挑战,即高计算复杂度和两阶段过程。为了解决这些问题,本文提出了一种快速自适应平衡最小割聚类模型(FBMC),该模型直接求解离散指标矩阵,不需要任何后处理。我们利用二部图来加速亲和图的构造和优化过程。此外,在模型中加入了平衡因子,可以缓解聚类结果的偏差。提出了两种具体的方法,一种在所有集群中添加一个平衡因子,另一种将平衡因子分别分配给每个集群,称为FBMC1和FBMC2。此外,对FBMC1和FBMC2提出了一步优化问题,其复杂度为线性的。
#2.实验结果
Table3 ACC±Standard deviation(%)ofcompared methods on benchmark datasets.
Table 4 NMI± Standard deviation(%)of compared methods on benchmark datasets
Fig.3.ACCvaries with No.of anchor points and neighbors for F BMC 2.(a)CAL16.(b)CAL28.(c) COIL100.(d) Connect4.(e)ISO5.(f)PAL25.(g)Protein.(h)YaleB.
Fig.1.The change of objective function valuewith the number of iteration.(a)COIL20.(b)COIL20_0.01.©MSRA25.(d)MSRA25_0.01.
Table1 Comparison for S BMC,EBM Can dour proposed method
#3.主要贡献
-我们提出了一种快速自适应平衡的最小切割聚类,旨在最大限度地提高簇内相似性,可以作为最小切割聚类的替代方法。该方法采用了锚定图,从而降低了计算的复杂度。
-通过引入平衡因子,所提出的模型考虑了聚类的平衡,从而避免了最小切割聚类中的平凡解问题。平衡因子的计算是自适应的,模型是无参数的。
-提出了一种单步优化方法来解决优化问题。此外,在我们提出的模型中,离散指标矩阵直接通过坐标下降法求解,不需要任何后处理操作,其复杂度为线性时间。
#4.方法
3.Proposed methodology The proposed unified framework is illustrated as follows min F ∈ I n d , A ∈ D i a g ∥ B T B − F A F T ∥ F 2 \operatorname*{min}_{F\in\mathrm{Ind},A\in D i a g}\left\|B^{T}B-F A F^{T}\right\|_{F}^{2} minF∈Ind,A∈Diag BTB−FAFT F2
where A = d i a g { λ 1 , λ 2 , … , λ c } , λ j > 0 , j = 1 , 2 , … , c . A {\cal{A}}\,=\,d i a g\{\lambda_{1},\lambda_{2},\ldots,\lambda_{c}\},\lambda_{j}\,>\,0,j\,=\,1,2,\ldots,c.\ {\cal{A}} A=diag{λ1,λ2,…,λc},λj>0,j=1,2,…,c. A is a diagonal matrix composed of balanced factors.When Λ = λ I c \varLambda\;=\;\lambda\pmb{I}_{c} Λ=λIc ,theproblem degradesto
min F ∈ I n d , λ > 0 ∥ B T B − λ F F T ∥ F 2 . \operatorname*{min}_{F\in\mathrm{Ind},\lambda>0}\left\|B^{T}B-\lambda F F^{T}\right\|_{F}^{2}. F∈Ind,λ>0min BTB−λFFT F2.
When ∃ i ≠ j , λ i ≠ λ j , \exists i\neq j,\lambda_{i}\neq\lambda_{j}, ∃i=j,λi=λj, the optimization problem is
min F ∈ I n d , A ∈ D i a g , ∃ i ≠ j , λ i ≠ λ j ∥ B T B − F A F T ∥ F 2 . \operatorname*{min}_{F\in\mathrm{Ind},A\in D i a g,\exists i\neq j,\lambda_{i}\neq\lambda_{j}}\left\|B^{T}B-F A F^{T}\right\|_{F}^{2}. F∈Ind,A∈Diag,∃i=j,λi=λjmin BTB−FAFT F2.
Through the adaptive balanced factors in problem(9)and(10),the final clustering results would achieve high within-cluster similarity and avoid unbalanced clustering outcomes.Infact, A A A could be extended to any symmetric matrix,resulting in the following problem
min F ∈ I n d , A = A T ∥ B T B − F A F T ∥ F 2 . \operatorname*{min}_{F\in\mathrm{Ind},A=A^{T}}\left\|B^{T}B-F A F^{T}\right\|_{F}^{2}. F∈Ind,A=ATmin BTB−FAFT F2.
Problem(9)could be formulated as the following form and we denote the objective function as J 1 J_{1} J1
max J 1 ( F , λ ) = max F ∈ I n d , λ > 0 2 λ Tr ( F T B T B F ) − λ 2 ∥ F ∥ e . \operatorname*{max}J_{1}(F,\lambda)=\operatorname*{max}_{F\in\mathrm{Ind},\lambda>0}2\lambda\operatorname{Tr}\left(F^{T}B^{T}B F\right)-\lambda^{2}\|F\|_{e}. maxJ1(F,λ)=F∈Ind,λ>0max2λTr(FTBTBF)−λ2∥F∥e.
By taking the partial derivative of λ \lambda λ the expression for the optimal valueof λ \lambda λ can be obtained as follows
λ = T r ( F T B T B F ) T r ( F T 1 n 1 n T F ) . \lambda=\frac{\mathrm{Tr}\left(F^{T}B^{T}B F\right)}{\mathrm{Tr}\left(F^{T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}F\right)}. λ=Tr(FT1n1nTF)Tr(FTBTBF).
From the expression of λ \lambda λ it can be seen that the higher the similarity within the clusters and the more balanced the clustering results,the smaller the value of λ \lambda λ From problem(4),a small value of λ \lambda λ indicatesa bigvalue of γ \gamma γ which shows that the balancing term plays a more sig n if i cant role.By substituting the expression of λ \lambda λ in J 1 J_{1} J1 the formulation of J 1 J_{1} J1 becomes
ax J 1 ( F ) = max F ∈ I n d ( Tr ( F T B T B F ) ) 2 Tr ( F T 1 n 1 n T F ) = max F ∈ I n d ( ∑ j = 1 c f j T B T B f j ) 2 ∑ j = 1 c f j T 1 n 1 n T f j . \operatorname{ax}J_{1}(F)=\operatorname*{max}_{F\in\mathrm{Ind}}{\frac{\left(\operatorname{Tr}\left(F^{T}B^{T}B F\right)\right)^{2}}{\operatorname{Tr}\left(F^{T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}F\right)}}=\operatorname*{max}_{F\in\mathrm{Ind}}{\frac{\left(\sum_{j=1}^{c}\mathbf{f}_{j}^{T}B^{T}B\mathbf{f}_{j}\right)^{2}}{\sum_{j=1}^{c}\mathbf{f}_{j}^{T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{j}}}. axJ1(F)=F∈IndmaxTr(FT1n1nTF)(Tr(FTBTBF))2=F∈Indmax∑j=1cfjT1n1nTfj(∑j=1cfjTBTBfj)2.
For F ( 0 ) F^{(0)} F(0) 1, f i p ( 0 ) = 1 f_{i p}^{(0)}=1 fip(0)=1 While as for F ( l ) F^{(l)} F(l) f i l ( l ) = 1 f_{i l}^{(l)}=1 fil(l)=1 Thus, when l ≠ p l\neq p l=p , the pth and /th column are different for F ( 0 ) F^{(0)} F(0) and F ( l ) F^{(l)} F(l) When ,they are l = p l=p l=p the same.Thus,the calculation of J 1 ( F ( l ) ) J_{1}(F^{(l)}) J1(F(l)) is discussed in two cases.
Case1:When l ≠ p l\neq p l=p thevalueof J 1 ( F ( l ) ) J_{1}(F^{(l)}) J1(F(l)) is
J 1 ( F ( l ) ) = ( ∑ j = 1 c f j ( l ) T B T B f j ( l ) ) 2 ∑ j = 1 c f j ( l ) T 1 n 1 n T f j ( l ) . J_{1}(F^{(l)})=\frac{\left(\sum_{j=1}^{c}\mathbf{f}_{j}^{(l)T}B^{T}B\mathbf{f}_{j}^{(l)}\right)^{2}}{\sum_{j=1}^{c}\mathbf{f}_{j}^{(l)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{j}^{(l)}}. J1(F(l))=∑j=1cfj(l)T1n1nTfj(l)(∑j=1cfj(l)TBTBfj(l))2.
Wedefine Ω = { j ∣ j = 1 , 2 , … , c } \varOmega\,=\,\{j|j\,=\,1,2,\dotsc,c\} Ω={j∣j=1,2,…,c} and Ω ( l ) = { j ∣ j = 1 , 2 , … , c , j ≠ \Omega^{(l)}\,=\,\{j|j\,=\,1,2,\dots,c,j\,\neq\, Ω(l)={j∣j=1,2,…,c,j= toexpress more clearly.Since the columns of F ( 0 ) F^{(0)} F(0) and F ( l ) F^{(l)} F(l) are l , j ≠ p } l,j\neq p\} l,j=p} the same expect/th and pth column, J 1 ( F ( l ) ) J_{1}(F^{(l)}) J1(F(l)) can also be written as
( ∑ j ∈ Ω ( l ) f j ( 0 ) T B T B f j ( 0 ) + f l ( l ) T B T B f l ( l ) + f p ( l ) T B T B f p ( l ) ) 2 ∑ j ∈ Ω ( l ) f j ( 0 ) T 1 n 1 n T f j ( 0 ) + f l ( l ) T 1 n 1 n T f l ( l ) + f p ( l ) T 1 n 1 n T f p ( l ) . \frac{\left(\sum_{j\in\Omega^{(l)}}\mathbf{f}_{j}^{(0)T}B^{T}B\mathbf{f}_{j}^{(0)}+\mathbf{f}_{l}^{(l)T}B^{T}B\mathbf{f}_{l}^{(l)}+\mathbf{f}_{p}^{(l)T}B^{T}B\mathbf{f}_{p}^{(l)}\right)^{2}}{\sum_{j\in\Omega^{(l)}}\mathbf{f}_{j}^{(0)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{j}^{(0)}+\mathbf{f}_{l}^{(l)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{l}^{(l)}+\mathbf{f}_{p}^{(l)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{p}^{(l)}}. ∑j∈Ω(l)fj(0)T1n1nTfj(0)+fl(l)T1n1nTfl(l)+fp(l)T1n1nTfp(l)(∑j∈Ω(l)fj(0)TBTBfj(0)+fl(l)TBTBfl(l)+fp(l)TBTBfp(l))2.
By observing f l ( l ) \mathbf{f}_{l}^{(l)} fl(l) and f l ( 0 ) \mathbf{f}_{l}^{(0)} fl(0) ,itisfound that theyonly differ in the it h element, that is f i l ( l ) = 1 \mathbf{f}_{i l}^{(l)}=1 fil(l)=1 and f i l ( 0 ) = 0 \mathbf{f}_{i l}^{(0)}=0 fil(0)=0 . Thus, f l ( l ) = f l ( 0 ) + e i , \mathbf{f}_{l}^{(l)}=\mathbf{f}_{l}^{(0)}+\mathbf{e}_{i}, fl(l)=fl(0)+ei, where e i \mathbf{e}_{i} ei is a column vector with its i t h i\mathbf{th} ith elementas 1 ^{1} 1 and others a sO.Then,for f l ( l ) T B T B f l ( l ) \mathbf{f}_{l}^{(l)T}B^{T}B\mathbf{f}_{l}^{(l)} fl(l)TBTBfl(l) and f l ( l ) T 1 n 1 n T f l ( l ) \mathbf{f}_{l}^{(l)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{l}^{(l)} fl(l)T1n1nTfl(l) wehave
f l ( l ) T B T B f l ( l ) = ( f l ( 0 ) + e i ) T B T B ( f l ( 0 ) + e i ) = f l ( 0 ) T B T B f l ( 0 ) + 2 b i T B f l ( 0 ) + b i T b i . f l ( l ) T 1 n 1 n T f l ( l ) = ( f l ( 0 ) + e i ) T 1 n 1 n T ( f l ( 0 ) + e i ) = f l ( 0 ) T 1 n 1 n T f l ( 0 ) + 2 1 n T f l ( 0 ) + 1. \begin{array}{r l}&{\mathbf{f}_{l}^{(l)T}B^{T}B\mathbf{f}_{l}^{(l)}=\left(\mathbf{f}_{l}^{(0)}+\mathbf{e}_{i}\right)^{T}B^{T}B\left(\mathbf{f}_{l}^{(0)}+\mathbf{e}_{i}\right)}\\ &{\qquad\qquad\qquad\qquad=\mathbf{f}_{l}^{(0)T}B^{T}B\mathbf{f}_{l}^{(0)}+2\mathbf{b}_{i}^{T}B\mathbf{f}_{l}^{(0)}+\mathbf{b}_{i}^{T}\mathbf{b}_{i}.}\\ &{\mathbf{f}_{l}^{(l)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{l}^{(l)}=\left(\mathbf{f}_{l}^{(0)}+\mathbf{e}_{i}\right)^{T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\left(\mathbf{f}_{l}^{(0)}+\mathbf{e}_{i}\right)}\\ &{\qquad\qquad\qquad\qquad\qquad=\mathbf{f}_{l}^{(0)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{l}^{(0)}+2\mathbf{1}_{n}^{T}\mathbf{f}_{l}^{(0)}+1.}\end{array} fl(l)TBTBfl(l)=(fl(0)+ei)TBTB(fl(0)+ei)=fl(0)TBTBfl(0)+2biTBfl(0)+biTbi.fl(l)T1n1nTfl(l)=(fl(0)+ei)T1n1nT(fl(0)+ei)=fl(0)T1n1nTfl(0)+21nTfl(0)+1.
f p ( I ) T B T B f p ( I ) = ( f p ( 0 ) − e i ) T B T B ( f p ( 0 ) − e i ) = f p ( 0 ) T B T B f p ( 0 ) − 2 b i T B f p ( 0 ) + b i T b i , f p ( I ) T 1 n 1 n T f p ( I ) = ( f p ( 0 ) − e i ) T 1 n 1 n T ( f p ( 0 ) − e i ) = f p ( 0 ) T 1 n 1 n T f p ( 0 ) − 2 1 n T f p ( 0 ) + 1. \begin{array}{r l}&{\mathbf{f}_{p}^{(I)T}B^{T}B\mathbf{f}_{p}^{(I)}=\left(\mathbf{f}_{p}^{(0)}-\mathbf{e}_{i}\right)^{T}B^{T}B\left(\mathbf{f}_{p}^{(0)}-\mathbf{e}_{i}\right)}\\ &{\qquad\qquad\qquad\qquad=\mathbf{f}_{p}^{(0)T}B^{T}B\mathbf{f}_{p}^{(0)}-2\mathbf{b}_{i}^{T}B\mathbf{f}_{p}^{(0)}+\mathbf{b}_{i}^{T}\mathbf{b}_{i},}\\ &{\mathbf{f}_{p}^{(I)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{p}^{(I)}=\left(\mathbf{f}_{p}^{(0)}-\mathbf{e}_{i}\right)^{T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\left(\mathbf{f}_{p}^{(0)}-\mathbf{e}_{i}\right)}\\ &{\qquad\qquad\qquad\qquad=\mathbf{f}_{p}^{(0)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{p}^{(0)}-2\mathbf{1}_{n}^{T}\mathbf{f}_{p}^{(0)}+1.}\end{array} fp(I)TBTBfp(I)=(fp(0)−ei)TBTB(fp(0)−ei)=fp(0)TBTBfp(0)−2biTBfp(0)+biTbi,fp(I)T1n1nTfp(I)=(fp(0)−ei)T1n1nT(fp(0)−ei)=fp(0)T1n1nTfp(0)−21nTfp(0)+1.
TakeEqs.(18)-(21) intoEq.(17), J 1 ( F ( l ) ) J_{1}(F^{(l)}) J1(F(l)) becomes
( ∑ j ∈ Ω f j ( 0 ) T B T B f j ( 0 ) + 2 b i T B f l ( 0 ) − 2 b i T B f p ( 0 ) + 2 b i T b i ) 2 ∑ j ∈ Ω f j ( 0 ) T 1 n 1 n T f j ( 0 ) + 2 1 n T f l ( 0 ) − 2 1 n T f p ( 0 ) + 2 . \frac{\left(\sum_{j\in\Omega}\mathbf{f}_{j}^{(0)T}\mathbf{B}^{T}\mathbf{B}\mathbf{f}_{j}^{(0)}+2\mathbf{b}_{i}^{T}\mathbf{B}\mathbf{f}_{l}^{(0)}-2\mathbf{b}_{i}^{T}\mathbf{B}\mathbf{f}_{p}^{(0)}+2\mathbf{b}_{i}^{T}\mathbf{b}_{i}\right)^{2}}{\sum_{j\in\Omega}\mathbf{f}_{j}^{(0)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}\mathbf{f}_{j}^{(0)}+2\mathbf{1}_{n}^{T}\mathbf{f}_{l}^{(0)}-2\mathbf{1}_{n}^{T}\mathbf{f}_{p}^{(0)}+2}. ∑j∈Ωfj(0)T1n1nTfj(0)+21nTfl(0)−21nTfp(0)+2(∑j∈Ωfj(0)TBTBfj(0)+2biTBfl(0)−2biTBfp(0)+2biTbi)2.
Case2:When l = p l=p l=p the value of objective function is
J 1 ( F ( l ) ) = ( T r ( F ( 0 ) T B T B F ( 0 ) ) ) 2 T r ( F ( 0 ) T 1 n 1 n T F ( 0 ) ) . J_{1}(F^{(l)})=\frac{\left(\mathrm{Tr}\left(F^{(0)T}B^{T}B F^{(0)}\right)\right)^{2}}{\mathrm{Tr}\left(F^{(0)T}{\bf1}_{n}{\bf1}_{n}^{T}F^{(0)}\right)}. J1(F(l))=Tr(F(0)T1n1nTF(0))(Tr(F(0)TBTBF(0)))2.
We denote u as the value of objective function when we update it h row and I t h I{\mathrm{th}} Ith column. Considering that when l ≠ p l\neq p l=p , f i l ( 0 ) = 0 f_{i l}^{(0)}=0 fil(0)=0 and when l = p l=p l=p thevalueof f i l f_{i l} fil is 1 ^{1} 1 U can be written in a unified form:
( T r ( F ( 0 ) T B T B F ( 0 ) ) + 2 b i T ( 1 − f i l ( 0 ) ) ( B f l ( 0 ) − B f p ( 0 ) + b i ) ) 2 T r ( F ( 0 ) T 1 n 1 n T F ( 0 ) ) + 2 ( 1 − f i l ( 0 ) ) ( 1 n T f l ( 0 ) − 1 n T f p ( 0 ) + 1 ) . \frac{\left(\mathrm{Tr}(F^{(0)T}B^{T}B F^{(0)})+2{\bf b}_{i}^{T}(1-f_{i l}^{(0)})(B{\bf f}_{l}^{(0)}-B{\bf f}_{p}^{(0)}+{\bf b}_{i})\right)^{2}}{\mathrm{Tr}\left(F^{(0)T}{\bf1}_{n}{\bf1}_{n}^{T}F^{(0)}\right)+2(1-f_{i l}^{(0)})({\bf1}_{n}^{T}{\bf f}_{l}^{(0)}-{\bf1}_{n}^{T}{\bf f}_{p}^{(0)}+1)}. Tr(F(0)T1n1nTF(0))+2(1−fil(0))(1nTfl(0)−1nTfp(0)+1)(Tr(F(0)TBTBF(0))+2biT(1−fil(0))(Bfl(0)−Bfp(0)+bi))2.
From formula(24),there are five elements needed to be stored in advance,whichare T r ( F ( 0 ) I B I B F ( 0 ) ) , T r ( F ( 0 ) I 1 n 1 n T F ( 0 ) ) , B F ( 0 ) , 1 n T F ( 0 ) \mathrm{Tr}(F^{(0)I}B^{I}B F^{(0)}),\,\mathrm{Tr}(F^{(0)I}1_{n}1_{n}^{T}F^{(0)}),\,B F^{(0)},\,1_{n}^{T}F^{(0)} Tr(F(0)IBIBF(0)),Tr(F(0)I1n1nTF(0)),BF(0),1nTF(0) and b i T b i , i = 1 , 2 , … , n . \mathbf{b}_{i}^{T}\mathbf{b}_{i},i=1,2,\ldots,n. biTbi,i=1,2,…,n. Since b i T b i , i = 1 , 2 , … , n \mathbf{b}_{i}^{T}\mathbf{b}_{i},i=1,2,\ldots,n biTbi,i=1,2,…,n is not changed at all there is no need to update it.After obtaining the new f ι \mathbf{f}^{\iota} fι ,otherfour elements do not need to update as well if q = p q\ =\ p q = p However,if q ≠ p q\neq p q=p the optimal F ( q ) F^{(q)} F(q) would replace F ( 0 ) F^{(0)} F(0) and these four elements should be updated for the update of next row.Based on the above analysis we do not need tore calculate the values of these elements,which can be updated by pre-stored variables.The updating formulations are exhibited below
T r ( F ( 0 ) T B T B F ( 0 ) ) ← T r ( F ( 0 ) T B T B F ( 0 ) ) + 2 b i T B f q ( 0 ) − 2 b i T B F p ( 0 ) + 2 b i T b i T r ( F ( 0 ) T 1 n T F ( 0 ) ) ← T r ( F ( 0 ) T 1 n T F ( 0 ) ) + 2 1 n T f q ( 0 ) − 2 1 n T F p ( 0 ) + 2 B F ( 0 ) ( : , q ) ← B F ( 0 ) ( : , q ) + B ( : , i ) B F ( 0 ) ( : , p ) + B F ( 0 ) ( : , p ) − B ( : , i ) 1 n T F ( 0 ) ( q ) ← 1 n T F ( 0 ) ( q ) + 1 1 n T F ( 0 ) ( p ) ← 1 n T F ( 0 ) ( p ) − 1 \begin{array}{r l}&{\mathrm{Tr}(F^{(0)T}B^{T}B F^{(0)})\gets\mathrm{Tr}(F^{(0)T}B^{T}B F^{(0)})+2\mathbf{b}_{i}^{T}B\mathbf{f}_{q}^{(0)}}\\ &{\qquad\qquad\qquad\qquad-2\mathbf{b}_{i}^{T}B F_{p}^{(0)}+2\mathbf{b}_{i}^{T}\mathbf{b}_{i}}\\ &{\mathrm{Tr}(F^{(0)T}\mathbf{1}_{n}^{T}F^{(0)})\gets\mathrm{Tr}(F^{(0)T}\mathbf{1}_{n}^{T}F^{(0)})+2\mathbf{1}_{n}^{T}\mathbf{f}_{q}^{(0)}}\\ &{\qquad\qquad\qquad\qquad\qquad-2\mathbf{1}_{n}^{T}F_{p}^{(0)}+2}\\ &{B F^{(0)}(:,q)\gets B F^{(0)}(:,q)+B(:,i)}\\ &{B F^{(0)}(:,p)+B F^{(0)}(:,p)-B(:,i)}\\ &{\mathbf{1}_{n}^{T}F^{(0)}(q)\gets\mathbf{1}_{n}^{T}F^{(0)}(q)+1}\\ &{\mathbf{1}_{n}^{T}F^{(0)}(p)\gets\mathbf{1}_{n}^{T}F^{(0)}(p)-1}\end{array} Tr(F(0)TBTBF(0))←Tr(F(0)TBTBF(0))+2biTBfq(0)−2biTBFp(0)+2biTbiTr(F(0)T1nTF(0))←Tr(F(0)T1nTF(0))+21nTfq(0)−21nTFp(0)+2BF(0)(:,q)←BF(0)(:,q)+B(:,i)BF(0)(:,p)+BF(0)(:,p)−B(:,i)1nTF(0)(q)←1nTF(0)(q)+11nTF(0)(p)←1nTF(0)(p)−1
It is worth noting that when updating T r ( F ( 0 ) T B T B F ( 0 ) ) \mathrm{Tr}({\cal F}^{(0)T}B^{T}B{\cal F}^{(0)}) Tr(F(0)TBTBF(0)) and T r ( F ( 0 ) T 1 n 1 n T F ( 0 ) ) \mathrm{Tr}(F^{(0)T}\mathbf{1}_{n}\mathbf{1}_{n}^{T}F^{(0)}) Tr(F(0)T1n1nTF(0)) , several elements such as B f l ( 0 ) B\mathbf{f}_{l}^{(0)} Bfl(0) , B f p ( 0 ) B\mathbf{f}_{p}^{(0)} Bfp(0) , 1 n 1 n T \mathbf{1}_{n}\mathbf{1}_{n}^{T} 1n1nT and 1 n T f p \mathbf{1}_{n}^{T}\mathbf{f}_{p} 1nTfp can betaken from the corresponding values in B F ( 0 ) B F^{(0)} BF(0) and 1 n T F ( 0 ) \mathbf{1}_{n}^{T}F^{(0)} 1nTF(0) directly.
#5.总结&限制性
在本文中,我们提出了一种快速目适应平衡最小切割(FBMC)聚类方法,旨在最大化聚类内的相似性和平衡聚类结果。由SBMC和EBMC提出了两种具体的方法,分别命名为FBMC1和FBMC2。与最小切割聚类相比,FBMC没有平凡的解,且复杂度随时间呈线性关系。此外,FBMC1和FBMC2的优化算法是一步一步的,没有额外的超参数。而离散指标矩阵则可以用CD法直接求解,而不需要进行任何后处理。综合实验结果表明了FBMC1和FBMC2的优越性。然而,当在FBMC1和FBMC2中构造二部图时,需要邻居的数量作为一个已知的参数。快速准确地确定该参数是今后研究的一个方向。此外,在这两种方法中,在构造二部图后,仍然需要构造一个全连通图,这可能是不必要的。在未来,我们将关注的重点是从二部图中直接挖掘出聚类信息。