Support Vector Machine Classification Based on Fuzzy Clustering for Large Data Sets

Cervantes, Jair; Li, Xiaoou; Yu, Wen

doi:10.1007/11925231_54

Jair Cervantes²⁰,
Xiaoou Li²⁰ &
Wen Yu²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4293))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

1135 Accesses

Abstract

Support vector machine (SVM) has been successfully applied to solve a large number of classification problems. Despite its good theoretic foundations and good capability of generalization, it is a big challenging task for the large data sets due to the training complexity, high memory requirements and slow convergence. In this paper, we present a new method, SVM classification based on fuzzy clustering. Before applying SVM we use fuzzy clustering, in this stage the optimal number of clusters are not needed in order to have less computational cost. We only need to partition the training data set briefly. The SVM classification is realized with the center of the groups. Then the de-clustering and SVM classification via reduced data are used. The proposed approach is scalable to large data sets with high classification accuracy and fast convergence speed. Empirical studies show that the proposed approach achieves good performance for large data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 239.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A New Multi-class Fuzzy Support Vector Machine Algorithm

Support vector-based fuzzy classifier with adaptive kernel

Article 04 September 2017

Very large-scale data classification based on K-means clustering and multi-kernel SVM

Article 29 January 2018

References

Awad, M., Khan, L., Bastani, F., Yen, I.L.: An Effective support vector machine (SVMs) Performance Using Hierarchical Clustering. In: Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2004), November 15-17, 2004, vol. 00, pp. 663–667 (2004)
Google Scholar
Balcazar, J.L., Dai, Y., Watanabe, O.: Provably Fast Training Algorithms for support vector machine. In: Proc. of the 1st IEEE Int. Conf. on Data Mining, pp. 43–50. IEEE Computer Society, Los Alamitos (2001)
Chapter Google Scholar
Chih-Chung, C., Chih-Jen, L.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Collobert, R., Bengio, S.: SVMTorch: support vector machine for large regresion problems. Journal of Machine Learning Research 1, 143–160 (2001)
Article MathSciNet Google Scholar
Daniael, B., Cao, D.: Training support vector machine Using Adaptive Clustering. In: Proc. of SIAM Int. Conf on Data Mining 2004, Lake Buena Vista, FL, USA (2004)
Google Scholar
Joachims, T.: Making large-scale support vector machine learning practical. In: Scholkopf, A.S.B., Burges, C. (eds.) Advances in Kernel Methods: support vector machine, MIT Press, Cambridge (1998)
Google Scholar
Kim, S.W., Oommen, B.J.: Enhancing prototype reduction schemes with recursion: a method applicable for ”large” data sets. IEEE Transactions on Systems, Man and Cybernetics, Part B 34(3), 1384–1397 (2004)
Article Google Scholar
Lebrun, G., Charrier, C., Cardot, H.: SVM training time reduction using vector quantization. In: Proceedings of the 17th International Conference on pattern recognition, vol. 1, pp. 160–163 (2004)
Google Scholar
Li, K., Huang, H.K.: Incremental learning proximal support vector machine classifiers. In: Proceedings. 2002 International Conference on machine learning and cybernetics, vol. 3, pp. 1635–1637 (2002)
Google Scholar
Luo, F., Khan, L., Bastani, F., Yen, I., Zhou, J.: A Dynamical Growing Self Organizing Tree (DGSOT) for Hierarquical Clustering Gene Expression Profiles. Bioinformatics 20(16), 2605–2617 (2004)
Article Google Scholar
Pavlov, D., Mao, J., Dom, B.: Scaling-up support vector machine using boosting algorithm. In: Proceedings of 15th International Conference on pattern recognition, vol. 2, pp. 219–222 (2000)
Google Scholar
Platt, J.: Fast Training of support vector machine using sequential minimal optimization. In: Scholkopf, A.S.B., Burges, C. (eds.) Advances in Kernel Methods: support vector machine, MIT Press, Cambridge (1998)
Google Scholar
Xu, R., Wunsch II, D.: Survey of clustering algorithms. IEEE Transactions on Neural Networks 16(3), 645–678 (2005)
Article Google Scholar
Schohn, G., Cohn, D.: Less is more: Active Learning with support vector machine. In: Proc. 17th Int. Conf. Machine Learning, Stanford, CA (2000)
Google Scholar
Shih, L., Rennie, D.M., Chang, Y., Karger, D.R.: Text Bundling: Statistics-based Data Reduction. In: Proc of the Twentieth Int. Conf. on Machine Learning (ICML-2003), Washington (2003)
Google Scholar
Tong, S., Koller, D.: Support vector machine active learning with applications to text clasifications. In: Proc. 17th Int. Conf. Machine Learning, Stanford (2000)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
MATH Google Scholar
Van Gestel, T., Suykens, J.A.K., De Moor, B., Vandewalle, J.: Bayesian inference for LS-SVMs on large data sets using the Nystrom method. In: Proceedings of the 2002 International Joint Conference on neural networks, vol. 3, pp. 2779–2784 (2002)
Google Scholar
Yu, H., Yang, J., Jiawei, H.: Classifying Large Data Sets Using SVMs with Hierarchical Clusters. In: Proc. of the 9th ACM SIGKDD 2003, August 24-27, 2003, Washington (2003)
Google Scholar
Xu, R., Wunsch, D.: Survey of Clustering Algorithms. IEEE Trans. Neural Networks 16, 645–678 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Sección de Computación Departamento de Ingenierá Elétrica, CINVESTAV-IPN, A.P. 14-740, Av.IPN 2508, México D.F., 07360, México
Jair Cervantes & Xiaoou Li
Departamento de Control Automático, CINVESTAV-IPN, A.P. 14-740, Av.IPN 2508, México D.F., 07360, México
Wen Yu

Authors

Jair Cervantes
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoou Li
View author publications
You can also search for this author in PubMed Google Scholar
Wen Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, 07738, Mexico City, México
Alexander Gelbukh
Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Luis Enrique Erro No. 1, Sta. Ma. Tonanzintla, 72840, Puebla, México
Carlos Alberto Reyes-Garcia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cervantes, J., Li, X., Yu, W. (2006). Support Vector Machine Classification Based on Fuzzy Clustering for Large Data Sets. In: Gelbukh, A., Reyes-Garcia, C.A. (eds) MICAI 2006: Advances in Artificial Intelligence. MICAI 2006. Lecture Notes in Computer Science(), vol 4293. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11925231_54

Download citation

DOI: https://doi.org/10.1007/11925231_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49026-5
Online ISBN: 978-3-540-49058-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Support Vector Machine Classification Based on Fuzzy Clustering for Large Data Sets

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A New Multi-class Fuzzy Support Vector Machine Algorithm

Support vector-based fuzzy classifier with adaptive kernel

Very large-scale data classification based on K-means clustering and multi-kernel SVM

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Support Vector Machine Classification Based on Fuzzy Clustering for Large Data Sets

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A New Multi-class Fuzzy Support Vector Machine Algorithm

Support vector-based fuzzy classifier with adaptive kernel

Very large-scale data classification based on K-means clustering and multi-kernel SVM

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation