Academia.eduAcademia.edu

Human Language Technology

description1,102 papers
group3,891 followers
lightbulbAbout this topic
Human Language Technology (HLT) refers to the interdisciplinary field that focuses on the development of computational methods and tools for processing, analyzing, and generating human language. It encompasses areas such as natural language processing, speech recognition, and machine translation, aiming to enhance human-computer interaction through effective language understanding and generation.
lightbulbAbout this topic
Human Language Technology (HLT) refers to the interdisciplinary field that focuses on the development of computational methods and tools for processing, analyzing, and generating human language. It encompasses areas such as natural language processing, speech recognition, and machine translation, aiming to enhance human-computer interaction through effective language understanding and generation.

Key research themes

1. How does technology enhance second language speaking skills and oral communication?

This research theme investigates the integration of digital technologies and innovative tools specifically aimed at the improvement of speaking skills in second language learning. The focus is on identifying which technological interventions effectively promote oral proficiency, interactive communication, and fluency, examining methodologies that move beyond traditional drill-based teaching towards communicative and learner-centered approaches.

Key finding: The paper finds that modern technological tools—such as internet-based resources, podcasts, video conferencing, and speech recognition software—are beneficial for enhancing speaking skills in English language learners by... Read more
Key finding: This extensive review identifies that technologies supporting pronunciation training and chat-based tools have provided strong evidence of positive effects on speaking skill acquisition, highlighting that... Read more
Key finding: This study reports that innovative technologies integrated within communicative language teaching—including using multimedia, internet resources, and interactive digital materials—facilitate the development of communicative... Read more

2. What are the challenges and solutions for replicability and reproducibility in Human Language Technology (HLT) research?

This theme addresses the methodological rigor and transparency in HLT, focusing on reproducibility (recalculation of results using original data and methods) and replicability (independent re-implementation and validation). It explores the importance of resource availability, experimental documentation, and infrastructure to support open science practices that ensure credible and extensible research outcomes.

Key finding: The paper emphasizes that replicability and reproducibility are critical yet under-supported within HLT, advocating for open access to code, datasets, parameters, and procedural details. The introduction of a dedicated... Read more

3. How are linguistic corpora technologies employed as didactic tools in translator education?

Research in this area investigates the role of corpus linguistics technologies in training future translators, aiming to equip students with automated tools for large-scale text processing, frequency analysis, and linguistic data management that support theoretical research and practical translation competencies underpinned by standardization and technical skill acquisition.

Key finding: This research demonstrates that integrating corpora technology in translator training fulfills multiple educational functions—didactic, cognitive, testing—and aligns with international translation standards by promoting... Read more

4. What is the current state and future outlook of technology-enhanced language learning in terms of theory, practice, and teacher attitudes?

This theme synthesizes contemporary insights on the nexus between second language acquisition theory and technology-infused pedagogical practices. It also examines teacher acceptance, instructional design principles, and implementation challenges, providing a comprehensive evaluation of technology's role across skill domains and professional development.

Key finding: The book review highlights a structured synthesis linking SLA theories with technology integration, underscoring impacts on interactional competence, literacy, and domain-specific language learning. It notes teacher attitudes... Read more
Key finding: This conceptual analysis reveals that mobile-assisted and computer-mediated technologies are increasingly vital for vocabulary, pronunciation, and reading skills development, while emphasizing that motivation and meaningful... Read more
Key finding: The book review documents systematic ICT integration within pre-service teacher education curricula that promotes 21st-century skills through inquiry-, research-, problem-, and project-based learning models, combined with... Read more
Key finding: This empirical study assesses ChatGPT's use for Maltese language learning, identifying that while ChatGPT is accessible and facilitates conversational practice, its current limitations in understanding and responding... Read more

5. How is human language conceptualized and operationalized within human-computer interaction (HCI) and computer-mediated communication (CMC)?

This theme explores the complex role that natural human language plays in interaction design, interface localization, and cross-cultural communication within HCI contexts. It differentiates human languages from computer languages and discusses implications for multilingualism, user preferences, and semiotic engineering in digital communication systems.

Key finding: The paper provides a comprehensive theoretical framework that reveals human language as a multifaceted semiotic system central to all communication in HCI, advocating for a shift from mere interface localization towards... Read more

All papers in Human Language Technology

This document describes the Part-of-Speech (POS) tagging guidelines for the Penn Chinese Treebank Project. The goal of the project is the creation of a 100-thousand-word corpus of Mandarin Chinese text with syntactic bracketing. The... more
Computational, descriptive, and theoretical linguistics use both phrase (PS) structure and dependency structure (DS) to represent syntax. We believe that the next-generation treebank should be multi-representational, designed for both... more
Cuneiform documents, the earliest known form of writing, are prolific textual sources of the ancient past. Experts publish editions of these texts in transliteration using specialized typesetting, but most remain inaccessible for... more
This study examines the development of research on German prosody, based on a database of 960 sources. First, it highlights the main research focus areas and methodological approaches, including models of prosody, corpora, and annotation... more
The key problem to be faced when building a HMM-based continuous speech recogniser is maintaining the balance between model complexity and available training data. For large vocabulary systems requiring cross-word context dependent... more
The functionality of systems that extract information from texts can be specified quite simply: the input is a stream of texts and the output is some representation of the information to be extracted. Hence, the problem of template design... more
This paper deals with theoretical problems found in the work that is being carried out for annotating semantic roles in the Basque Dependency Treebank (BDT). We will present the resources used and the way the annotation is being done.... more
The study reported in this paper addresses three issues related to phonetic classification: 1) whether it is important to choose an appropriate signal representation, 2) whether there are any advantages in extracting acoustic attributes... more
This article presents the methods and workflow for semi-automatic linguistic annotation of Akkadian cuneiform texts and a Neo-Babylonian corpus created with them. The backbone of our workflow is BabyLemmatizer, a neural annotation... more
In today’s digital and connected environment, it ha s become much easier to dissect services like interpretation into very small units. In some cases , interpreters working in micro units, i.e. within a limited space of information, may... more
Although word association measures are useful for deciphering the semantic nuances of long extinct languages, they are very sensitive to excessively formulaic narrative patterns and full or partial duplication caused by different copies,... more
Abstract. As claimed in the Semantic Web project, a huge amount of physically distributed interacting software agents could find the semantic of available resources and answer more relevantly to users' requests if the content of... more
Our team has been working for several years on building adaptive systems using self-organising mechanisms following a specific approach we called the AMAS 1 Theory. Its main originality is that it enables artificial systems to show... more
Lexica play an important role in every linguistic discipline. We are confronted with many types of lexica. Depending on the type of lexicon and the language we are currently faced with a large variety of structures from very simple tables... more
Though the GYDER system has achieved the highest accuracy scores for the metonymy resolution shared task at SemEval-2007 in all six subtasks, we don't consider the results (72.80% accuracy for org, 84.36% for loc) particularly... more
In this paper we describe some preliminary results of qualitative evaluation of the answer-ing system HITIQA (High-Quality Interactive Question Answering) which has been devel-oped over the last 2 years as an advanced re-search tool for... more
The present paper seeks to analyse and describe the noun phrase in Khortha. This paper explores different forms of noun phrases in Khortha. Khortha belongs to the Indo-Aryan language family and is very similar to Hindi. According to... more
META-NET is a European network of excellence, founded in 2010, that consists of 60 research centres in 34 European countries. One of the key visions and goals of META-NET is a truly multilingual Europe, which is substantially supported... more
This research develops a neural machine translation system for converting ancient Old Assyrian cuneiform business records into modern English, addressing a long standing challenge in digital humanities and historical linguistics. Using a... more
Európa népeinek kultúrtörténetét kutatom ChatGPT5.2 mélykutatás verziójával, és minden nép esetében elemezem a modern nyelvük kifejlődését, majd az adott nyelv írásbeli formájának változásait, továbbá a magas-kulturális irodalmi és... more
In a globalized world that is increasingly connected, the translation market has grown significantly. The increase in market demand has led to a growing reliance on translation technologies by human translators, who now feel obliged to... more
In this paper we outline the use of the multipurpose software tool LeXimir in our approach to automated production of lemmas for e-dictionaries of multi-word units. Development of morphological dictionaries of MWUs is a tedious task,... more
Download research papers for free!