Adriaens, Florian Aslay, Cigdem De Bie, Tijl Gionis, Aristides Lijffijt, Jefrey
Cycles in graphs often signify interesting processes. For example, cyclic trading patterns can indicate inefficiencies or economic dependencies in trade networks, cycles in food webs can identify fragile dependencies in ecosystems, and cycles in financial transaction networks can be an indication of money laundering. Identifying such interesting cy...
Bendimerad, Anes Lijffijt, Jefrey Plantevit, Marc Robardet, Celine De Bie, Tijl
Concepts are often described in terms of positive integer-valued attributes that are organized in a hierarchy. For example, cities can be described in terms of how many places there are of various types (e.g. nightlife spots, residences, food venues), and these places are organized in a hierarchy (e.g. a Portuguese restaurant is a type of food venu...
Adriaens, Florian Lijffijt, Jefrey De Bie, Tijl
Consider a large graph or network, and a user-provided set of query vertices between which the user wishes to explore relations. For example, a researcher may want to connect research papers in a citation network, an analyst may wish to connect organized crime suspects in a communication network, or an internet user may want to organize their bookm...
Bendimerad, Anes Mel, Ahmad Lijffijt, Jefrey Plantevit, Marc Robardet, Céline De Bie, Tijl
Community detection in graphs, data clustering, and local pattern mining are three mature fields of data mining and machine learning. In recent years, attributed subgraph mining is emerging as a new powerful data mining task in the intersection of these areas. Given a graph and a set of attributes for each vertex, attributed subgraph mining aims to...
Bendimerad, Anes Mel, Ahmad Lijffijt, Jefrey Plantevit, Marc Robardet, Céline De Bie, Tijl
Community detection in graphs, data clustering, and local pattern mining are three mature fields of data mining and machine learning. In recent years, attributed subgraph mining is emerging as a new powerful data mining task in the intersection of these areas. Given a graph and a set of attributes for each vertex, attributed subgraph mining aims to...
Deng, Junning Lijffijt, Jefrey Kang, Bo De Bie, Tijl
This paper introduces an approach to find motifs in time series that are \emph{subjectively interesting}. That is, the aim is to find motifs that are surprising given an informative background distribution, which may for example correspond to the prior knowledge of a user of the tool. We quantify this surprisal using information theory, and more pa...
Kang, Bo Puolamäki, Kai Lijffijt, Jefrey De Bie, Tijl
We present SIDE, a tool for Subjective and Interactive Visual Data Exploration, which lets users explore high dimensional data via subjectively informative 2D data visualizations. Many existing visual analytics tools are either restricted to specific problems and domains or they aim to find visualizations that align with user’s belief about the dat...
Puolamäki, Kai Kang, Bo Lijffijt, Jefrey De Bie, Tijl
Data visualization and iterative/interactive data mining are growing rapidly in attention, both in research as well as in industry. However, integrated methods and tools that combine advanced visualization and data mining techniques are rare, and those that exist are often specialized to a single problem or domain. In this paper, we introduce a nov...
Kang, Bo Lijffijt, Jefrey Santos-Rodríguez, Raúl De Bie, Tijl
Methods that find insightful low-dimensional projections are essential to effectively explore high-dimensional data. Principal Component Analysis is used pervasively to find low-dimensional projections, not only because it is straightforward to use, but it is also often effective, because the variance in data is often dominated by relevant structur...