filename : Fuc09.pdf entry : inproceedings conference : Visualization 2009 pages : 1327-1334 year : 2009 month : November-December title : Visual Human+Machine Learning subtitle : author : Raphael Fuchs and Jürgen Waser and Eduard Gröller booktitle : IEEE Transactions on Visualization and Computer Graphics (Proceedings Visualization 2009) ISSN/ISBN : editor : publisher : IEEE Computer Society Press publ.place : volume : 15 issue : 6 language : english keywords : Knowledge Discovery, Volumetric Data, Multiple Competing Hypotheses, Computer-assisted Multivariate Data Expl, Predictive Analysis, Genetic Algorithm, Curse of Dimensionality, Interactive Visual Analysis abstract : In this paper we describe a novel method to integrate interactive visual analysis and machine learning to support the insight generation of the user. The suggested approach combines the vast search and processing power of the computer with the superior reasoning and pattern recognition capabilities of the human user. An evolutionary search algorithm has been adapted to assist in the fuzzy logic formalization of hypotheses that aim at explaining features inside multivariate, volumetric data. Up to now, users solely rely on their knowledge and expertise when looking for explanatory theories. However, it often remains unclear whether the selected attribute ranges represent the real explanation for the feature of interest. Other selections hidden in the large number of data variables could potentially lead to similar features. Moreover, as simulation complexity grows, users are confronted with huge multidimensional data sets making it almost impossible to find meaningful hypotheses at all. We propose an interactive cycle of knowledge-based analysis and automatic hypothesis generation. Starting from initial hypotheses, created with linking and brushing, the user steers a heuristic search algorithm to look for alternative or related hypotheses. The results are analyzed in information visualization views that are linked to the volume rendering. Individual properties as well as global aggregates are visually presented to provide insight into the most relevant aspects of the generated hypotheses. This novel approach becomes computationally feasible due to a GPU implementation of the time-critical parts in the algorithm. A thorough evaluation of search times and noise sensitivity as well as a case study on data from the automotive domain substantiate the usefulness of the suggested approach.