Contextualizing Ontologies with OntoLight: A Pragmatic Approach (2008)
Marko Grobelnik, Janez Brank, Blaž Fortuna, Igor Mozetič
We present a pragmatic approach to using large-scale ontologies as contexts. The approach is based on a lightweight ontology model and grounding of the ontology concepts in textual documents. These...
Contextualizing ontologies with ontolight : a pragmatic approach (2008)
Grobelnik, Marko, Brank, Janez, Fortuna, Blaz, Mozetic, Igor
Contextualizing ontologies with ontolight : a pragmatic approach
Feature selection for the classification of large document collections (2008)
Brank, Janez, Mladenić, Dunja, Grobelnik, Marko, Milic-Frayling, Natasa
Feature selection methods are often applied in the context of document classification. They are particularly important for processing large data sets that may contain millions of documents and are...
Automatic evaluation of ontologies (2007)
Brank, Janez, Grobelnik, Marko, Mladenić, Dunja
Automatic evaluation of ontologies
PREDICTING THE ADDITION OF NEW CONCEPTS IN A TOPIC HIERARCHY (2007)
Brank, Janez, Mladenić, Dunja, Grobelnik, Marko
Ontologies often change through time, a process largely done manually by human editors. We discuss the task of automatically predicting when structural changes will occur in a given ontology. We...
Hierarchical text categorization using coding matrices (2006)
Brank, Janez, Mladenić, Dunja, Grobelnik, Marko
We discuss the task of ontology population as a machine learning problem with a large hierarchy of classes. Since many machine learning methods are designed primarily for two-class problems, it is...
Loose Phrase String Kernels (2006)
When representing textual documents by feature vectors for the purposes of further processing (e.g. for categorization, clustering, or visualization), one possible representation is based on “loose...
Using DMoz for constructing ontology from data stream (2006)
Grobelnik, Marko, Brank, Janez, Mladenić, Dunja, Novak, Blaz, Fortuna, Blaz
This paper presents an approach for constructing an ontology from a stream of documents. Named entities extracted from the documents are used as instances of the ontology. Entities and co-occurring...
M.: Gold standard based ontology evaluation using instance assignment (2006)
An ontology is an explicit formal conceptualization of some domain of interest. Ontology evaluation is the problem of assessing a given ontology from the point of view of a particular criterion or...
A survey of ontology evaluation techniques (2005)
Brank, Janez, Grobelnik, Marko, Mladenić, Dunja
An ontology is an explicit formal conceptualization of some domain of interest. Ontologies are increasingly used in various fields such as knowledge management, information extraction, and the...
A survey of ontology evaluation techniques (2005)
Janez Brank, Marko Grobelnik, Dunja Mladenić
An ontology is an explicit formal conceptualization of some domain of interest. Ontologies are increasingly used in various fields such as knowledge management, information extraction, and the...
MACHINE LEARNING ON SETS OF DOCUMENTS CONNECTED IN GRAPHS (2004)
This paper deals with the problem of machine learning on sets of documents connected into graphs. Our strategy is to represent each document by a diverse set of heterogeneous attributes, including...
DRAWING GRAPHS USING SIMULATED ANNEALING AND GRADIENT DESCENT (2004)
This paper presents graph drawing as an optimization problem. Each vertex of the graph is to be represented by a point in the plane, and each edge by a straight line between two points. To evaluate a...
Feature selection using linear classifier weights: interaction with classification models (2004)
This paper explores feature scoring and selection based on weights from linear classification models. It investigates how these methods combine with various learning models. Our comparative analysis...
The Download Estimation task on KDD Cup 2003 (2003)
This paper describes our work on the Download Estimation task for KDD Cup 2003. The task requires us to estimate how many times a paper has been downloaded in the first 60 days after it has been...
The Download Estimation Task on KDD Cup 2003 (2003)
Janez Brank And, Janez Brank, Jure Leskovec
This paper describes our work on the Download Estimation task for KDD Cup 2003. The task requires us to estimate how many times a paper has been downloaded in the first 60 days after it has been...
Interaction of feature selection methods and linear classification models (2002)
In this paper we explore effects of various feature selection algorithms on document classification performance. We propose to use two, possibly distinct linear classifiers: one used exclusively for...