Chen, JiaqiJiaqiChenXu, YuyangYuyangXuXu LiDr. AZHAR Muhammad2025-12-222025-12-222026Big Data Mining and Analytics, 2026, vol. 9(1), pp. 4-22.2096-06542097-406Xhttp://hdl.handle.net/20.500.11861/26321Open accessGamma Mixture Model (GaMM) is a useful tool for representing complex distributions. However, estimating the parameters of GaMM faces challenges due to the lack of closed-form solution for the shape parameter. Existing parameter estimation methods face limitations stemming from their reliance on approximate computations, which degrade estimation accuracy, as well as the inherent complexity of numerical calculations, leading to computational inefficiency. To address these limitations and fully consider the multimodal nature of big data, this paper proposes a Mode-Partitioned GaMM (MP-GaMM) estimation method for large-scale multimodal data. The MP-GaMM method explores the spatial distribution characteristics of the data through clustering to partition the data into distinct modes, addresses mode overlap with a tune-up strategy, and employs closed-form estimator for parameter estimation of each mode in parallel. Experimental results demonstrate the rationality and effectiveness of the proposed MP-GaMM method, which outperforms existing methods in both accuracy and computational efficiency. Specifically, MP-GaMM exhibits lower error metrics, higher log-likelihood values and shorter runtime, indicating its capability to provide a more accurate estimation of the model parameters, and more precise characterization of the multimodal nature of large-scale data.enEstimation MethodMixture ModelData MethodsLarge-scale DataParameter EstimatesComputational EfficiencyNumerical CalculationsDistinct ModesShape ParameterApproximate ComputationClosed-Form ApproximationData DistributionBayesian InferenceMaximum Likelihood EstimationBimodalClustering AlgorithmProbability Density FunctionMarkov Chain Monte CarloExpectation MaximizationGamma DistributionLarge-Scale DatasetsLarge-Scale DistributionGaussian CopulaSynthetic Aperture RadarDistribution Of DatasetUnderlying Data DistributionProbability Density EstimationNegative Log-LikelihoodReal-World DatasetsRatio ThresholdA mode-partitioned gamma mixture model estimation method for large-scale multimodal dataPeer Reviewed Journal Article10.26599/BDMA.2025.9020045