razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- puri, s., a fuzzy similarity based concept...

7
1392

Upload: others

Post on 03-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

1392

Page 2: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

TF-IDF

12375830924

Page 3: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

11 2 12 3 13 4 14 4 15 4 16 5 17 5 18 5

21 8 22 8

221 8 222 9 223 TF-IDF10 224 12 225 12 226 13

23 14 231 14 232 15 233 15 234 15 235 16

24 17 241 k17 242 k17

Page 4: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

243 18 244 19

25 19 251 20 252 20

26 21

31 23 32 23 33 25 34 29

41 31 42 31 43 31 44 32

441 33 442 36 443 39

45 42 451 43 452 45 453 45

46 46

51 50 52 51

Page 5: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

59

1- Raghavan, P., Amer-Yahia, S., Gravano, L., Structure in Text: Extraction and Exploitation, Proceeding of the 7th international Workshop on the Web and Databases, ACM SIGMOD/PODS, 2004.

2- Falinouss, P., Stock trend prediction using news articles: a text mining approach,

Journal of computing, vol.18, No.1, 2007.

3- Hotho, A., Nürnberger, A., Paaß, G. A brief survey of text mining, LDV Forum-GLDV Journal for Computational Linguistics and Language Technology, Vol. 20. No. 1, pp. 19-62, 2005.

4- Khan, A., Baharum, B. B., Khan, K., An Overview of E-Documents Classification,

Proceedings of International Conference on Machine Learning and Computing, vol.3, 2009.

5- Chagheri, S., Calabretto, S., Roussey, C., Dumoulin, C., Document Classification

combining Structure and content, 13th International Conference on Entreprise Information Systems, pp. 135-148, ACM, 2011.

6- De Melo, G., Siersdorfer, S., Multilingual Text Classification using Ontologies, Advances in Information Retreval, Springer Berlin Heidelberg, pp. 541-548, 2007.

7- Pazzani, M., Muramatsu, J., Billsus, D., Syskill and Webert: identifying interesting

websites, Proc. of the 13th Amer. Nat. Conf. on Artificial Intelligence (AAAI/IAAI), vol.1, pp. 54-61, 1996.

8- Twycross, J., Cayzer, S., An immune-based approach to document classification,

Information Infrastructure Laboratory HP Laboratories Bristol, HPL-2002-292, pp. 33-46, 2002.

9- Sarawagi, S., Kirpal, A, Efficient set joins on similarity predicates, Proceedings of the

2004 ACM SIGMOD international conference on Management of data, pp. 743-754, ACM, 2004.

10- Arthur, D., Vassilvitskii, S., k-means++: The advantages of careful seeding,

Proceedings of the 18th annual ACM-SIAM symposium on Discrete algorithms, Society for Industrial and Applied Mathematics, pp. 1027-1035, 2007.

11- Onoda, T., Sakai, M., Yamada, S., Careful Seeding Method based on Independent

Components Analysis for k-means Clustering, Journal of Emerging Technologies in Web Intelligence, vol. 4.1, pp. 51-59, 2012.

12- Sayeed, A., Sarkar, S., Deng, Y., Characteristics of Document Similarity Measures for

Compliance Analysis, Proceedings of the 18th ACM conference on Information and knowledge management, pp. 1207-1216, ACM, 2009.

Page 6: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

60

13- Huang, A., Similarity measures for text document clustering, Proceedings of the 6th New Zealand Computer Science Research Student Conference, Christchurch, New Zealand, pp. 49-56, 2008.

14- Sandhya, N., Lalitha, Y. S., Govardhan, A., Anuradha, K., Analysis of similarity

measures for text clustering, International Journal of Data Engineering, vol. 2, No. 4, 2008.

15- Bang, S. L., Yang, J. D., Yang, H. J., Hierarchical document categorization with k-NN

and concept-based thesauri, Information processing & managementI, vol. 42, No. 2, pp. 387-406, 2006.

16- Bobrowski, L., Topczewska, M., Improving the K-NN Classification with the

Euclidean Distance Through Linear Data Transformations, Advances in Data Mining, Springer Berlin Heidelberg, pp. 23-32, 2004.

17- Kim, S. B., Han, K. S., Rim, H. C., Myaeng, S. H., Some Effective Techniques for

Naive Bayes Text Classification, IEEE Transactions on Knowledge and Data Engineering, vol. 18, No. 11, pp. 1457-1466, 2006.

18- Chai, K. M. A., Ng, H. T., Chieu, H. L., Bayesian Online Classifiers for Text

Classification and Filtering, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 97-104, ACM, 2002.

19- Wang, T. Y., Chiang, H. M., One-against-one fuzzy support vector machine classifier:

An approach to text categorization, Expert Systems with Applications, vol. 36, No. 6, pp. 10030-10034, 2009.

20- Chung, Y. Y., et al., A New Hybrid Audio Classification Algorithm Based on SVM

Weight Factor and Euclidean Distance, Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications, World Scientific and Engineering Academy and Society, pp. 152-157, 2007.

21- Pilaszy, I., Text Categorization and Support Vector Machines, Proceedings of the 6th

International Symposium of Hungarian Researchers on Computational Intelligence, pp.170-178, 2005.

22- Chim, H., Deng, X., Efficient Phrase-Based Document Similarity for Clustering, IEEE

Transactions on Knowledge and Data Engineering, vol. 20, No. 9, pp. 1217-1229, 2008.

23- Kent, C. K., Salim, N., Features Based Text Similarity Detection, Journal of

Computing, ISSN 2151-9617, vol. 2, No. 1, 2010.

24- Zhu, F., Yang, J., Zhou, Y., Enriched Format Text Categorization Using A Component Similarity Approach, Journal of Software, vol. 6, No. 9, pp. 1713-1720, 2011.

Page 7: Razmi thesis - ucna.ac.ir§براهیم_رزمی.pdf25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer

61

25- Puri, S., A Fuzzy Similarity Based Concept Mining Model for Text Classification, International Journal of Advanced Computer Science and Applications, Vol. 2, No. 11, 2011.

26- Bisson, C. G. G., Using a co-similarity approach on a large scale text categorization

task, MARAMI, 2011.

27- Lakshmi, P. S., Sushma, V., Manasa, T., Different Similarity Measures for Text Classification Using Knn, IOSR Journal of Computer Engineering , ISSN: 2278-0661, ISBN: 2278-8727 ,vol. 5, No. 6, PP. 30-36, 2012.

28- Strehl, A., Ghosh, J., Mooney, R., Impact of Similarity Measures on Web-page

Clustering, Workshop on Artificial Intelligence for Web Search (AAAI 2000), pp. 58-64, 2000.