Knowledge discovery enhanced with semantic and social information (2009)
Mladenic, Dunja, De Gemmis, Marco, Spiliopoulou, Myra, Semeraro, Giovanni, Svatek, Vojtech, ...
Using Text Mining and Link Analysis for Software Mining (2008)
Miha Grcar, Marko Grobelnik, Dunja Mladenic
Abstract. Many data mining techniques are these days in use for ontology learning – text mining, Web mining, graph mining, link analysis, relational data mining, and so on. In the current...
OntoGen Semi-automatic Ontology Editor (2008)
Blaz Fortuna, Marko Grobelnik, Dunja Mladenic
The rapid growth of documents, web pages and other types of textual content pose a great challenge to modern content management systems. Ontologies offer an efficient way to reduce the amount of...
OntoGen: Semi-automatic Ontology Editor (2008)
Blaz Fortuna, Marko Grobelnik, Dunja Mladenic
Abstract. In this paper we present a semi-automatic ontology editor as implemented in a new version of OntoGen system. The system integrates machine learning and text mining algorithms into an...
SEMANTIC GRAPHS DERIVED FROM TRIPLETS WITH APPLICATION IN DOCUMENT SUMMARIZATION (2008)
Rusu, Delia, Fortuna, Blaz, Grobelnik, Marko, Mladenic, Dunja
Information nowadays has become more and more accessible, so much as to give birth to an information overload issue. Yet important decisions have to be made, depending on the available information....
EXTENDING ONTOLOGIES FOR ANNOTATING BUSINESS NEWS (2008)
Novalija, Inna, Mladenic, Dunja
Ontologies are commonly used for annotating textual data mainly based on human language technologies [1]. This research focuses on manual extensions of ontologies to support the annotation of...
Analyzing Tag Semantics Across Collaborative Tagging Systems (2008)
Benz, Dominik, Grobelnik, Marko, Hotho, Andreas, Jaschke, Robert, Mladenic, Dunja, Servedio, Vito D. P., ...
The objective of our group was to exploit state-of-the-art Information Retrieval methods for finding associations and dependencies between tags, capturing and representing differences in tagging...
08391 Working Group Summary -- Analyzing Tag Semantics Across Tagging Systems (2008)
Benz, Dominik, Grobelnik, Marko, Hotho, Andreas, Jäschke, Robert, Mladenic, Dunja, Servedio, Vito D. P., ...
The objective of our group was to exploit state-of-the-art Information Retrieval methods for finding associations and dependencies between tags, capturing and representing differences in tagging...
Classification Tree Chaining in Data Analysis (2007)
Dunja Mladenic, Marko Grobelnik, Ray J. Paul, Ivan Bratko
Understanding of discrete event simulation systems is a difficult task. Our approach involves consultation with a domain expert, and the use of discrete event simulation model and machine learning as...
Rayid Ghani, Rosie Jones, Dunja Mladenic
query generation: finding documents matching a minority concept
Dunja Mladenic, Rayid Ghani, Rosie Jones
query generation: finding documents matching a minority concept
Rayid Ghani, Rosie Jones, Dunja Mladenic
query generation: nding documents matching a minority concept
From Web to Social Web: discovering and deploying user and content profiles (2007)
Hotho, Andreas, Mladenic, Dunja, Semeraro, Giovanni
This book constitutes the refereed proceedings of the Workshop on Web Mining, WebMine 2006, held in Berlin, Germany, September 2006. Topics included are data mining based on analysis of bloggers and...
Alphonse, Erick, Aubin, Sophie, Derivière, Julien, Hamon, Thierry, Mladenic, Dunja, Nazarenko, Adeline, ...
This deliverable describes the specifications of the Natural Language Processing line to be developed within the Work Package 5 (WP5) in the Alvis Project. The WP5 is in charge of the NLP aspects of...
Alphonse, Erick, Aubin, Sophie, Derivière, Julien, Hamon, Thierry, Mladenic, Dunja, Nazarenko, Adeline, ...
This deliverable describes the specifications of the Natural Language Processing line to be developed within the Work Package 5 (WP5) in the Alvis Project. The WP5 is in charge of the NLP aspects of...
Alphonse, Erick, Aubin, Sophie, Derivière, Julien, Hamon, Thierry, Mladenic, Dunja, Nazarenko, Adeline, ...
This deliverable describes the specifications of the Natural Language Processing line to be developed within the Work Package 5 (WP5) in the Alvis Project. The WP5 is in charge of the NLP aspects of...
Alphonse, Erick, Aubin, Sophie, Derivière, Julien, Hamon, Thierry, Mladenic, Dunja, Nazarenko, Adeline, ...
This deliverable describes the specifications of the Natural Language Processing line to be developed within the Work Package 5 (WP5) in the Alvis Project. The WP5 is in charge of the NLP aspects of...
Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis (2006)
Bast, Holger, Dupret, Georges, Majumdar, Debapriyo, Piwowarski, Benjamin, Ackermann, Markus, Berendt, Bettina, ...
We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decomposition, such as latent semantic...
A roadmap for web mining (2004)
Berendt, Bettina, Hotho, Andreas, Mladenic, Dunja, Someren, Maarten Van, Spiliopoulou, Myra, Stumme, Gerd
Auch erschienen in: Berendt, Bettina u.a. (Hrsg.): Web mining: from web to semantic web. (Lecture notes in computer science ; 3209). Berlin u.a. : Springer, 2004. S. 1-22. ISBN 3-540-23258-3 (The...
A Roadmap for Web Mining: (2004)
From Web To, Bettina Berendt, Andreas Hotho, Dunja Mladenic, Maarten Van Someren, Myra Spiliopoulou, ...
this article form specific challenges, in particular: the privacy and security of patient data, the semantics of visual material (cf. the Digital Imaging and Communications in Medicine standard:...
M.: Visualizing Very Large Graphs Using Clustering Neighborhoods (2004)
Dunja Mladenic, Marko Grobelnik
Abstract. This paper presents a method for visualization of large graphs in a two-dimensional space, such as a collection of Web pages. The main contribution here is in the representation change to...
A roadmap for web mining (2004)
Berendt, Bettina, Hotho, Andreas, Mladenic, Dunja, Someren, Maarten Van, Spiliopoulou, Myra, Stumme, Gerd
Auch erschienen in: Berendt, Bettina u.a. (Hrsg.): Web mining: from web to semantic web. (Lecture notes in computer science ; 3209). Berlin u.a. : Springer, 2004. S. 1-22. ISBN 3-540-23258-3 (The...
Rosiejones And Dunja, Rayid Ghani, Dunja Mladenic
The web is a sou rce of valu able information bu t the process of collecting, organizing and e#ectivelyu tilizing theresou rces it contains is di#cu lt. We describe Corpu Bu ilder, an approach for au...
Under consideration for publication in Knowledge and Information (2003)
Systems Building Minority, Rayid Ghani, Rosie Jones, Dunja Mladenic
The web is a source of valuable information but the process of collecting, organizing and utilizing these resources is dicult. We describe CorpusBuilder, an approach for automatically generating web...
First European Web Mining Forum Edited by (2003)
Bettina Berendt, Andreas Hotho, Dunja Mladenic, Maarten Van Someren, Myra Spiliopoulou, Gerd Stumme, ...
ECML/PKDD-2003: 1st European Web Mining Forum
Building minority language corpora by learning to generate web search queries (2001)
Rayid Ghani, Rosie Jones, Dunja Mladenic
The Web is an obvious source of valuable information but the process of collecting, organizing and utilizing these resources is difficult. We describe CorpusBuilder, an approach for automatically...
Building minority language corpora by learning to generate web search queries (2001)
Rayid Ghani, Rosie Jones, Dunja Mladenic
The Web is an obvious source of valuable information but the process of collecting, organizing and utilizing these resources is difficult. We describe CorpusBuilder, an approach for automatically...
Mining the web to create minority language corpora (2001)
Rayid Ghani, Rosie Jones, Dunja Mladenic
The Web is a valuable source of language specific resources but the process of collecting, organizing and utilizing these resources is difficult. We describe CorpusBuilder, an approach for...
Machine learning for better Web browsing (2000)
This paper describes a way to help the user browsing the Web using Machine Learning techniques. Two machine learning algorithms, Naive Bayes and k-Nearest Neighbor, are compared on two approaches...
Data Mining on Symbolic Knowledge Extracted from the Web (2000)
Rayid Ghani, Rosie Jones, Dunja Mladenic, Kamal Nigam, Sean Slattery
Information extractors and classifiers operating on unrestricted, unstructured texts are an errorful source of large amounts of potentially useful information, especially when combined with a crawler...
Data Mining on Symbolic Knowledge Extracted from the Web (2000)
Rayid Ghani, Rosie Jones, Dunja Mladenic, Kamal Nigam, Sean Slattery
Information extractors and classifiers operating on unrestricted unstructured texts are an errorful source of large amounts of potentially useful information, especially when combined with a crawler...
Feature selection for unbalanced class distribution and Naive Bayes (1999)
Dunja Mladenic, Marko Grobelnik
This paper describes an approach to feature subset selection that takes into account problem specifics and learning algorithm characteristics. It is developed for the Naive Bayesian classifier...
Predicting Content From Hyperlinks (1999)
Dunja Mladenic, Marko Grobelnik
This paper describes an approach to prediction of a document content based on the hyperlink that points to the document. The k-Nearest Neighbor algorithm is used to predict a set of words that appear...
Assigning Keywords to Documents Using Machine Learning (1999)
Dunja Mladenic, Marko Grobelnik
This paper describes the usage of machine learning techniques to assign keywords to documents. The large hierarchy of documents available on the Web, Yahoo hierarchy, is used here as a real-world...
Predicting Content from Hyperlinks (1999)
Dunja Mladenic, Marko Grobelnik
This paper describes an approach to prediction of a document content based on the hyperlink that points to the document. The k-Nearest Neighbor algorithm is used to predict a set of words that appear...
Text-Learning and Related Intelligent Agents (1999)
Analysis of text data using intelligent information retrieval, machine learning, natural language processing or other related methods is becoming an important issue for the development of intelligent...
Machine learning used by Personal WebWatcher (1999)
This paper describes design of personal browsing assistant Personal WebWatcher that suggests interesting hyperlinks on the requested Web documents. Machine learning is used to generate a model of...
Text-learning and intelligent agents (1998)
We present an overview of some work in text-learning through the prism of the three research questions important for development of textlearning intelligent agents: what representation is used for...
Text-Learning and Intelligent Agents (1998)
We present an overview of some work in text-learning through the prism of the three research questions important for development of textlearning intelligent agents: what representation is used for...
Strojno Ucenje Na Nehomogenih, Distribuiranih Tekstovnih Podatkih (1998)
Dunja Mladenic, Univerza V Ljubljani, Komentor Tom, Advisors Prof, Ivan Bratko, Prof Tom, ...
This dissertation proposes new machine learning methods where the corresponding learning problem is characterized by a high number of features, unbalanced class distribution and asymmetric...
Efficient Text Categorization (1998)
Marko Grobelnik, Dunja Mladenic
We present an approach to text categorization using machine learning techniques. The approach is developed and tested on large text hierarchy named Yahoo that is available on the Web. We handle the...
Word Sequences as Features in Text-Learning (1998)
Dunja Mladenic, Marko Grobelnik
This paper proposes an efficient algorithm for the generation of new features that enrich the known bagof -words document representation. New features are generated based on word sequences of...
References for: Machine learning on non-homogeneous, distributed text data (1998)
a, J., Huang, J., & Vafaie, H. (1995). Hybrid Learning Using Genetis Algorithms and Decision Trees for Pattern Classification. Proceedings of the 14th International Joint Conference on Artificial...
Turning Yahoo into an Automatic Web-Page Classifier (1998)
. The paper describes an approach to automatic Web-page classification based on the Yahoo hierarchy. Machine learning techniques developed for learning on text data are used here on the hierarchical...
Personal WebWatcher: design and implementation (1996)
Introduction With the growing availability of information sources, especially non-homogeneous, distributed sources like the World Wide Web, there is also a growing interest in tools that can help in...
Knowledge Acquisition for Discrete Event Systems Using Machine Learning (1994)
Dunja Mladenic, Ivan Bratko, Ray J. Paul, Marko Grobelnik
Knowledge acquisition for an understanding of discrete event simulation systems is a difficult task. Machine Learning has been investigated to help in the knowledge acquisition process. Our approach...