razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- puri, s., a fuzzy similarity based concept...
TRANSCRIPT
1392
TF-IDF
12375830924
11 2 12 3 13 4 14 4 15 4 16 5 17 5 18 5
21 8 22 8
221 8 222 9 223 TF-IDF10 224 12 225 12 226 13
23 14 231 14 232 15 233 15 234 15 235 16
24 17 241 k17 242 k17
243 18 244 19
25 19 251 20 252 20
26 21
31 23 32 23 33 25 34 29
41 31 42 31 43 31 44 32
441 33 442 36 443 39
45 42 451 43 452 45 453 45
46 46
51 50 52 51
59
1- Raghavan, P., Amer-Yahia, S., Gravano, L., Structure in Text: Extraction and Exploitation, Proceeding of the 7th international Workshop on the Web and Databases, ACM SIGMOD/PODS, 2004.
2- Falinouss, P., Stock trend prediction using news articles: a text mining approach,
Journal of computing, vol.18, No.1, 2007.
3- Hotho, A., Nürnberger, A., Paaß, G. A brief survey of text mining, LDV Forum-GLDV Journal for Computational Linguistics and Language Technology, Vol. 20. No. 1, pp. 19-62, 2005.
4- Khan, A., Baharum, B. B., Khan, K., An Overview of E-Documents Classification,
Proceedings of International Conference on Machine Learning and Computing, vol.3, 2009.
5- Chagheri, S., Calabretto, S., Roussey, C., Dumoulin, C., Document Classification
combining Structure and content, 13th International Conference on Entreprise Information Systems, pp. 135-148, ACM, 2011.
6- De Melo, G., Siersdorfer, S., Multilingual Text Classification using Ontologies, Advances in Information Retreval, Springer Berlin Heidelberg, pp. 541-548, 2007.
7- Pazzani, M., Muramatsu, J., Billsus, D., Syskill and Webert: identifying interesting
websites, Proc. of the 13th Amer. Nat. Conf. on Artificial Intelligence (AAAI/IAAI), vol.1, pp. 54-61, 1996.
8- Twycross, J., Cayzer, S., An immune-based approach to document classification,
Information Infrastructure Laboratory HP Laboratories Bristol, HPL-2002-292, pp. 33-46, 2002.
9- Sarawagi, S., Kirpal, A, Efficient set joins on similarity predicates, Proceedings of the
2004 ACM SIGMOD international conference on Management of data, pp. 743-754, ACM, 2004.
10- Arthur, D., Vassilvitskii, S., k-means++: The advantages of careful seeding,
Proceedings of the 18th annual ACM-SIAM symposium on Discrete algorithms, Society for Industrial and Applied Mathematics, pp. 1027-1035, 2007.
11- Onoda, T., Sakai, M., Yamada, S., Careful Seeding Method based on Independent
Components Analysis for k-means Clustering, Journal of Emerging Technologies in Web Intelligence, vol. 4.1, pp. 51-59, 2012.
12- Sayeed, A., Sarkar, S., Deng, Y., Characteristics of Document Similarity Measures for
Compliance Analysis, Proceedings of the 18th ACM conference on Information and knowledge management, pp. 1207-1216, ACM, 2009.
60
13- Huang, A., Similarity measures for text document clustering, Proceedings of the 6th New Zealand Computer Science Research Student Conference, Christchurch, New Zealand, pp. 49-56, 2008.
14- Sandhya, N., Lalitha, Y. S., Govardhan, A., Anuradha, K., Analysis of similarity
measures for text clustering, International Journal of Data Engineering, vol. 2, No. 4, 2008.
15- Bang, S. L., Yang, J. D., Yang, H. J., Hierarchical document categorization with k-NN
and concept-based thesauri, Information processing & managementI, vol. 42, No. 2, pp. 387-406, 2006.
16- Bobrowski, L., Topczewska, M., Improving the K-NN Classification with the
Euclidean Distance Through Linear Data Transformations, Advances in Data Mining, Springer Berlin Heidelberg, pp. 23-32, 2004.
17- Kim, S. B., Han, K. S., Rim, H. C., Myaeng, S. H., Some Effective Techniques for
Naive Bayes Text Classification, IEEE Transactions on Knowledge and Data Engineering, vol. 18, No. 11, pp. 1457-1466, 2006.
18- Chai, K. M. A., Ng, H. T., Chieu, H. L., Bayesian Online Classifiers for Text
Classification and Filtering, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 97-104, ACM, 2002.
19- Wang, T. Y., Chiang, H. M., One-against-one fuzzy support vector machine classifier:
An approach to text categorization, Expert Systems with Applications, vol. 36, No. 6, pp. 10030-10034, 2009.
20- Chung, Y. Y., et al., A New Hybrid Audio Classification Algorithm Based on SVM
Weight Factor and Euclidean Distance, Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications, World Scientific and Engineering Academy and Society, pp. 152-157, 2007.
21- Pilaszy, I., Text Categorization and Support Vector Machines, Proceedings of the 6th
International Symposium of Hungarian Researchers on Computational Intelligence, pp.170-178, 2005.
22- Chim, H., Deng, X., Efficient Phrase-Based Document Similarity for Clustering, IEEE
Transactions on Knowledge and Data Engineering, vol. 20, No. 9, pp. 1217-1229, 2008.
23- Kent, C. K., Salim, N., Features Based Text Similarity Detection, Journal of
Computing, ISSN 2151-9617, vol. 2, No. 1, 2010.
24- Zhu, F., Yang, J., Zhou, Y., Enriched Format Text Categorization Using A Component Similarity Approach, Journal of Software, vol. 6, No. 9, pp. 1713-1720, 2011.
61
25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer Science and Applications, Vol. 2, No. 11, 2011.
26- Bisson, C. G. G., Using a co-similarity approach on a large scale text categorization
task, MARAMI, 2011.
27- Lakshmi, P. S., Sushma, V., Manasa, T., Different Similarity Measures for Text Classification Using Knn, IOSR Journal of Computer Engineering , ISSN: 2278-0661, ISBN: 2278-8727 ,vol. 5, No. 6, PP. 30-36, 2012.
28- Strehl, A., Ghosh, J., Mooney, R., Impact of Similarity Measures on Web-page
Clustering, Workshop on Artificial Intelligence for Web Search (AAAI 2000), pp. 58-64, 2000.