Academia.eduAcademia.edu

Web Intelligence

description3,332 papers
group2,373 followers
lightbulbAbout this topic
Web Intelligence refers to the study and application of techniques for gathering, analyzing, and interpreting data from the web to enhance decision-making processes. It encompasses areas such as data mining, natural language processing, and machine learning to extract meaningful insights from vast amounts of online information.
lightbulbAbout this topic
Web Intelligence refers to the study and application of techniques for gathering, analyzing, and interpreting data from the web to enhance decision-making processes. It encompasses areas such as data mining, natural language processing, and machine learning to extract meaningful insights from vast amounts of online information.

Key research themes

1. How can data mining and AI techniques be effectively applied to extract and utilize web data across content, structure, and usage dimensions?

Research in this area explores the application of data mining methodologies to diverse categories of web data — including web content, link structures, and user interactions — to extract meaningful knowledge that enhances information retrieval, personalization, and web service optimization. This focus is critical as the volume and heterogeneity of web data grow exponentially, requiring tailored approaches that address unique characteristics of web-based information unlike traditional data mining.

Key finding: This foundational paper provides a comprehensive taxonomy of web mining by categorizing it into three distinct types — web content mining, web structure mining, and web usage mining — each focusing on different web data types... Read more
Key finding: The study advances web intelligence by integrating soft computing techniques such as fuzzy logic, neural networks, genetic algorithms, and support vector machines with web mining. It demonstrates about the efficacy of these... Read more
Key finding: This paper introduces an adaptive approach using ART (Adaptive Resonance Theory) neural networks for analyzing web usage logs. It demonstrates that neural network models can effectively process large-scale, heterogeneous, and... Read more
Key finding: This work studies the role of web crawlers as integral components of web mining systems, emphasizing how crawling frameworks are adapted for mining related and interacted web pages. It discusses the integration of crawler... Read more

2. What role do semantic web technologies and knowledge representation methods play in advancing machine understanding and automated information retrieval on the web?

This research theme focuses on how semantic web frameworks, ontologies, and formal knowledge representation languages facilitate more precise, interoperable, and machine-readable descriptions of web data. It addresses challenges in encoding semantics beyond syntactic markup to allow reasoning and advanced querying, which are key to intelligent web applications capable of context-aware search, knowledge retrieval, and enhanced interoperability between heterogeneous data sources.

Key finding: This paper presents WebKB, a web-accessible tool that uses conceptual graphs (CGs) for representing semantic statements embedded in web documents. It argues that creating knowledge-based metadata languages requires balancing... Read more
Key finding: This study explores the use of semantic web standards and ontology-based metadata to create richer, interactive, and user-adaptive media content experiences. The research prototypes an application that employs semantic... Read more
Key finding: MoRAI proposes a peer-to-peer architecture to enable RDF data exchange and semantic overlay networking under conditions of intermittent internet connectivity. The system organizes peers geographically and semantically to... Read more

3. How can machine learning and AI improve the efficacy of web-based tools for user-centered applications such as search engine optimization, privacy policy analysis, and reputation systems?

This area investigates the integration of AI and machine learning techniques in web applications that directly impact user experiences and trust. Research here deals with the automatic identification and structuring of web content for SEO, leveraging deep learning for reputation and opinion mining, and applying machine learning for improving privacy policy summarization and decision support. These efforts aim to bridge gaps between vast, unstructured web data and actionable, user-friendly knowledge.

Key finding: This paper proposes an autonomous approach to identify entities from short, unstructured text fragments on web pages to populate ontologies for semantic web content structuring. The approach targets reducing the human... Read more
Key finding: This study introduces a reputation system combining web mining techniques and deep learning applied to Covid-19 related Twitter data. It demonstrates that deep neural networks can automatically classify and analyze user... Read more
Key finding: PrivacyCheck employs machine learning to automatically summarize complex online privacy policies for end-users and includes a competitor analysis component to recommend services with better privacy. Usage data analysis... Read more

All papers in Web Intelligence

As web sites proliferate, offering more of the same, why does a customer choose one web site over the others? Among other factors, website intelligence offers a viable answer to this question. However, past studies fall short of providing... more
Straipsnyje aptariamas komandomis grįsto mokymo(si) strategijos elementų pritaikymas virtualiai aplinkai, taikant Web 5.0 saityno technologijas. Komandomis grįsto mokymo(si) strategija pasirinkta dėl jos efektyvumo aukštajame moksle. Web... more
This article proposes a system architecture of a case based web services reasoner, CWSR, looks at the correspondence relationship between the main activities of CBR and those of web services, and proposes a unified approach for case-based... more
Introduction: In the last couple of decades, Amharic-English translation has greatly improved from a rule-based approach to contemporary systems that apply neural networks. Even after these advancements, problems remain because of the... more
Information marketplaces are places where users search and retrieve information goods. Intelligent Agents could represent the participating entities in such places, i.e, assume the role of buyers and sellers of information products. In... more
This paper describes iShakti, a real-world, Intelligent, Interactive and Adaptive Web application. At present, iShakti is deployed across 1000 rural kiosks in India, covering 5000 villages and reaching 1 million people. Further scale up... more
Domain ontology can help in information retrieval from documents. But ontology is a pre-defined structure with crisp concept descriptions and inter-concept relations. However, due to the dynamic nature of the document repository, ontology... more
In this paper we describe an information extraction and text mining system which identifies key information components from text documents. The information components are centered on domain entities and their relationships. The components... more
In this paper we have proposed a system that performs both ontology-based text information extraction and ontology update using the extracted information. The system employs text-mining techniques to mine information from text documents... more
Recommender systems firstly appeared few years ago back in the early 1990's and tend to be on very high demand. Nowadays, they mostly can be found on every website, because every website has something, a similar item, to recommend. Since... more
On the market there are many commercial web classification services and a few publicly available web directory services. Unfortunately they mostly focus on English-speaking web sites, making them unsuitable for other languages in terms of... more
Methodology, in intelligence, consists of the methods used to make decisions about threats, especially in the intelligence analysis discipline. The enormous amount of information collected by intelligence agencies often puts them in the... more
Systems for Ambient Intelligence environments demand at some stage a service composition task, as a mean of adaptability to the context changes. However, in contrast to what ubiquitous and pervasive computing propose, users generally find... more
Recent advances in the standardization of knowledge representation languages to realize the Semantic Web as well as advances in natural language processing techniques have resulted in increased commercial efforts to create so called... more
The paper describes how interpretations of multimedia documents can be formally derived using abduction over domain knowledge represented in an ontology. The approach uses an expressive ontology specification language, namely description... more
The paper describes how interpretations of multimedia documents can be formally derived using abduction over domain knowledge represented in an ontology. The approach uses an expressive ontology specification language, namely description... more
Recent advances in the standardization of knowledge representation languages to realize the Semantic Web as well as advances in natural language processing techniques have resulted in increased commercial efforts to create so called... more
The automatic detection of emotions in Twitter posts is a challenging task due to the informal nature of the language used in this platform. In this paper, we propose a methodology for expanding the NRC word-emotion association lexicon... more
Context. Asynchronous messaging is increasingly used to support human–machine interactions, generally implemented through chatbots. Such virtual entities assist the users in activities of different kinds (e.g., work, leisure, and... more
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal... more
Language systems have been of great interest to the research community and have recently reached the mass market through various assistant platforms on the web. Reinforcement Learning methods that optimize dialogue policies have seen... more
The Information on web resources is increasing day by day because of increment of user’s interaction and contribution in web resources. It is becoming a challenge to collect relevant and useful information from this huge amount of data... more
Asynchronous messaging is leading human-machine interaction due to the boom of mobile devices and social networks. The recent release of dedicated APIs from messaging platforms boosted the development of computer programs able to conduct... more
We present two secure real-time video conferencing solutions with JPEG coding using a cipher engine based on the Blowfish algorithm. One of these solutions is developed from Java media framework (JMF). In this case, our solution consists... more
With the recent growth of the Social Web, an emerging challenge is how we can integrate information from the heterogeneity of current Social Web sites to improve semantic access to the information and knowledge across the entire World... more
Product reviews written by on-line shoppers is a valuable source of information for potential new customers who desire to make an informed purchase decision. Manually processing quite a few dozens, or even hundreds, of reviews for a... more
We use Conceptual Graphs (CGs) to model web content extraction rules (CG-Wrappers). The approach presented incorporates all major existing extraction techniques and allows the definition of synergies of cooperative wrappers for handling... more
Abstract. Modern successful on-line shops and product compari-son sites allow consumers to express their opinion on products and services they purchased. Although such information can be useful to other potential customers, reading and... more
In this paper, some auction systems are applied to the parking reservation system which would exert an important role in the next generation traffic systems. As an introduction evaluation of the auction systems for the parking... more
With the recent growth of the Social Web, an emerging challenge is how we can integrate information from the heterogeneity of current Social Web sites to improve semantic access to the information and knowledge across the entire World... more
Advent of technologies like semantic web, multi-agent systems, web mining has changed the internet as knowledge provider. Web personalization offers a solution to the information overload problem in current web by providing users a... more
Context-aware computing is widely accepted as a promising paradigm to enable seamless computing. Several middlewares and ontology-based models for describing context information have been developed in order to support contextaware... more
Interception of a data stream is central to any intelligent and dynamic processing of web information. It is perhaps as fundamental to Internet services' overall architecture as the design of disk scheduling to the conventional machine... more
Download research papers for free!