Determination of Variables for a Bayesian Network and the Most Precious One

Cinicioglu E. N. , Yenilmez T.

16th International Conference on Information Processing and Management of Uncertainty in Knowledge-based Systems (IPMU), Eindhoven, Netherlands, 20 - 24 June 2016, vol.610, pp.313-325 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 610
  • Doi Number: 10.1007/978-3-319-40596-4_27
  • City: Eindhoven
  • Country: Netherlands
  • Page Numbers: pp.313-325
  • Keywords: Bayesian networks, Variable selection in Bayesian networks, Importance hierarchy of variables in network, Variable evaluation scores, Limited data sets


To ensure the quality of a learned Bayesian network out of limited data sets, evaluation and selection process of variables becomes necessary. With this purpose, two new variable selection criteria N2Sj and N3Sj are proposed in this research which show superior performance on limited data sets. These newly developed variable selection criteria with the existing ones from prior research are employed to create Bayesian networks from three different limited data sets. On each step of variable elimination, the performance of the resulting BNs are evaluated in terms of different network performance metrics. Furthermore, a new variable evaluation criteria, IHj, is proposed which measures the impact of a variable to all the other variables in the network. IHj serves as an indicator of the most important variables in the network which has a special importance for the use of BNs in social science research, where it is crucial to identify the most important factors in a setting.