IJRCS – Volume 5 Issue 4 Paper 3

AVOIDANCE FOR DIRECT AND INDIRECT DISCRIMINATION IN DATA MINING

Author’s Name : S Saravanan | S Joseph Gabriel

Volume 05 Issue 04  Year 2018  ISSN No:  2349-3828  Page no: 7 – 9

12

Abstract:

Data mining helps to extract useful and expected information among the huge amount of collective data present in database. Automated data collection with data mining collectively performs automated decisions. Discrimination can be direct or indirect. Direct discrimination use sensitive data for decision making. Indirect discrimination makes decisions on the basis of non-sensitive data. For more accuracy they express the relationship between discrimination prevention and privacy preservation in data mining. Along with security and privacy, proper discrimination performs vital role in considering legal as well as ethical point of view of data mining. The main aim behind this paper is to develop new preprocessing discrimination prevention methodology which consists of different types of data transformation methods. With the help of that direct discrimination, indirect discrimination or both of them at the same time get prevented. For making the final decision there are two steps in which first step include identification of categories and makes groups of individuals whatever it may be, directly indirectly discriminated for making decision. In second step with the help of clustering, transformation of data in specific way such that removes all discrimination.

Keywords:

Direct Discrimination , Indirect Discrimination, Clustering etc.,

References:

  1. R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases,” Proc. 20th Int’l Conf. Very Large Data Bases, pp. 487-499, 1994.
  2. T. Calders and S. Verwer, “Three Naive Bayes Approaches for Discrimination-Free Classification,” Data Mining and Knowledge Discovery, vol. 21, no. 2, pp. 277-292, 2010.
  3. European Commission, “EU Directive 2004/113/EC on Anti- Discrimination,” http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2004:373:0037: 0043:EN:PDF, 2004.
  4. European Commission, “EU Directive 2006/54/EC on Anti- Discrimination,” http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2006:204:0023: 0036:en:PDF, 2006.
  5. S. Hajian, J. Domingo-Ferrer, and A. Marti´nez-Balleste´, “Discrimination Prevention in Data Mining for Intrusion and Crime Detection,” Proc. IEEE Symp. Computational Intelligence in Cyber Security (CICS ’11), pp. 47-54, 2011.
  6. S. Hajian, J. Domingo-Ferrer, and A. Marti´nez-Balleste´, “Rule Protection for Indirect Discrimination Prevention in Data Mining,” Proc. Eighth Int’l Conf. Modeling Decisions for Artificial Intelligence (MDAI ’11), pp. 211-222, 2011.
  7. F. Kamiran and T. Calders, “Classification without Discrimination,” Proc. IEEE Second Int’l Conf. Computer, Control and Comm. (IC4 ’09), 2009.
  8. F. Kamiran and T. Calders, “Classification with no Discrimination by Preferential Sampling,” Proc. 19th Machine Learning Conf. Belgium and The Netherlands, 2010.
  9. F. Kamiran, T. Calders, and M. Pechenizkiy, “Discrimination Aware Decision Tree Learning,” Proc. IEEE Int’l Conf. Data Mining (ICDM ’10), pp. 869-874, 2010.
  10. S . I. Chowdhury.: Statistical Expert Systems – A Special Application areafor Knowledge-based Computer Methodology. Linkoping Studies in Scienceand Technology, Thesis No- 104.,Department of Computer and Information Science, University of Linkoping, Sweden.
  11. H. Kordylewski and D. Graupe, “Applications of the LAMSTAR neural network to medical and engineering diagnosis/fault detection,” in Proc7th Artificial Neural Networks in Eng. Conf., St. Louis, MO, 1997.
  12. G.Z. Wu, “The application of data mining for medical database”, Master Thesis of Department of Biomedical Engineering, Chung Yuan University, Taiwan, Chung Li, 2000.
  13. D. Akoumianakis, N. Vidakis, G. Vellis, D. Kotsalis, G. Milolidakis, Plemenos, A. Akrivos and D. Stefanakis, Transformable Boundary Artifacts for Knowledge-based Work in Cross- organization Virtual Communities Spaces, Journal of Intelligent Decision Technologies Vol. 5 (1), 2011, in press.
  14. M. Berlingerio, F. B. F. Giannotti, and F. Turini, “Mining clinical data with a temporal dimension: A case study,” in Proc. IEEE Int. Conf. Bioinf Biomed., Nov. 2–4, 2007, pp. 429–436.
  15. Kokol P, Povalej, P., Lenic, M, Štiglic, G.: Building classifier cellular automata. 6th international conference on cellular automata for research and industry, ACRI 2004, Amsterdam, The Netherlands, October 25-27, 2004. (Lecture notes in computer science, 3305). Berlin: Springer, 2004, pp. 823-830.
  16. L. Li, L. Jing, and D. Huang, “Protein-protein interaction extraction from biomedical literature based on modified SVM-KNN,” in Nat. Lang. Process. Know. Engineer., 2009, pp. 1–7.
  17. R. Carvalho, R. Isola, and A. Tripathy, “MediQuery—An automated decision support system,” in Proc. 24th Int. Symp. Comput.-Based Med Syst.,Jun. 27–30, 2011, pp. 1–6.