Conferences (past & future) (or click here to go to list of CBL-related project sites/centres)Most of the events here have links to programmes/speakers and abstracts. I try to update this page frequently, but… no guarantees! * Upcoming conferences are indicated by " |
||||||
|
ICAME conferences: Annual conference series for corpus-based research on English (ICAME = International Computer Archive of Modern and Medieval English). For a complete list of ICAME conferences and their proceedings, click HERE. |
||||||
|
|
26-30 May 2010, |
|||||
|
ICAME 2009 (30th conf) |
27-31 May 2009, |
|||||
|
ICAME 2008 (29th conf) |
14-18 May 2008, |
|||||
|
ICAME 2007 (28th conf) |
23-27 May 2007, |
|||||
|
ICAME 2006 (27th conf) |
24-28 May, 2006, - Some pictures from the conference taken by |
|||||
|
ICAME 2005 (26th conf) |
See above entry for the North American Conference on Corpus Linguistics |
|||||
|
ICAME 2004 (25th conf) |
19-23 May, 2004, |
|||||
|
ICAME 2003 (24th conf) |
23-27 April, 2003, |
|||||
|
ICAME 2002 (23rd conf) |
22-26 May 2002, - Some pictures from the conference taken by |
|||||
|
ICAME 2001 (22nd conf) |
16-20 May 2001, - Has programme and abstracts of all papers and posters. - Some pictures from the conference taken by Knut Hofland
are here,
and |
|||||
|
ICAME 2000 (21st conf) |
21-25th April 2000, - Programme here - And pictures taken by Knut Hofland are here, and
pictures by |
|||||
|
ICAME 1999 (20th conf) |
26-30th May 1999, - No web site that I can find, but here's a summary of the conference, and here are pictures from the conference taken by Knut Hofland and Sebastian Hoffmann. |
|||||
|
TaLC (Teaching & Language Corpora) Conference series held every 2 years; pedagogical focus; not restricted to English. For a list of TALC, PALC, and "Corpus Linguistics" conferences and their proceedings, click HERE. |
||||||
|
|
To be held in Brno, Czech Republic. |
|||||
|
4-6 July 2008, |
||||||
|
2-4 July 2006, |
||||||
|
6-9 July 2004, (b&w). |
||||||
|
TaLC 2002 |
27-31 July 2002, |
|||||
|
19-23rd July 2000, |
||||||
|
24-27th July 1998, |
||||||
|
9-12th August, 1996, |
||||||
|
10-13th April, 1994, |
||||||
|
"CL" ("Corpus Linguistics"-branded) Conferences. Held every 2 years. Differs from ICAME in that it's not restricted to English. |
||||||
|
CL2009 (Corpus Linguistics 2009) |
21-23 July 2009, (Pre-conference workshop: 20 July 2009). Deadline for
submission was 23 January 2009. |
|||||
|
CL2007 (Corpus Linguistics 2005) |
27-30 July 2007, |
|||||
|
CL2005 (Corpus Linguistics 2005) |
14-17 July 2005, |
|||||
|
CL2003 (Corpus Linguistics 2003) |
28 March - 1 April 2003, |
|||||
|
CL2001 (Corpus Linguistics 2001) |
30 March - 2 April 2001, - Some pictures from the conference here |
|||||
|
AACL Conferences (American Association for Applied
Corpus Linguistics) Started off as
the "North American Symposium on Corpus Linguistics and Language
Teaching", and was once "AAACL", but I guess there were too
many 'A's, so one was dropped. |
||||||
|
8-11 October, 2009, University of Alberta, Edmonton, Alberta, Canada. (Deadline for submission of abstracts was May 1, 2009) |
||||||
|
13-15 March, 2008, |
||||||
|
AAACL 2006 |
20 -22 October,
2006, |
|||||
|
12-15 May, 2005, |
||||||
|
21-23 May, 2004, [Proceedings published as: Fitzpatrick, Eileen (Ed.).
(2007). Corpus Linguistics Beyond the Word. Corpus Research from Phrase to
Discourse. |
||||||
|
AAACL 2002 |
1-3 November 2002, |
|||||
|
AAACL 2001 |
23-25 March 2001, Park Plaza Hotel, |
|||||
|
AAACL 2000 |
31 March - 2 April, 2000, Northern |
|||||
|
AAACL 1999 |
20-22 May 1999, |
|||||
|
IVACS (Inter-Varietal Applied Corpus Studies) A newcomer on the scene, also focusing on pedagogical applications of CBL |
||||||
|
|
18-19 June, 2010, * Call for Papers & conference details from here (Deadline
for abstracts: 20 Dec 2009) |
|||||
|
IVACS 4 ("Applying Corpus Linguistics") |
13-14 June, 2008, * Call for Papers & conference details from here |
|||||
|
IVACS 3 ("Language at the Interface") |
23-24 June, 2006, Centre for Research in Applied
Linguistics, * Call for Papers & conference details from here |
|||||
|
IVACS 2 ("Analyzing Discourse in Context") |
25-26 June, 2004, The Graduate School of Education,
Queen’s University, * Call for Papers & conference details from here |
|||||
|
IVACS 1 ("Analyzing Discourse in Context") |
June 2002, |
|||||
|
PALC (Practical Applications in Language Corpora)
Conference series held every 2 years in |
||||||
|
PALC 2003 |
4-6 April 2003, Call for Papers & conference details from the Corpora List archive here Lewandowska-Tomaszczyk, B. (ed.) Practical
Applications in Language and Computers (PALC 2003). Frankfurt am |
|||||
|
PALC 2001 |
7-9 Sept 2001,
Lódz, Poland. |
|||||
|
PALC 1999 |
15-18 April
1999, Lódz, Poland. Lewandowska-Tomaszczyk,
Barbara & Patrick James MELIA (Eds.) (2000). PALC'99--Practical
Applications in Language Corpora: papers from the international
conference at the |
|||||
|
|
12-14 April
1997, Lódz, Poland. Lewandowska-Tomaszczyk,
Barbara & Patrick James MELIA (Eds.) (1997). International
Conference on Practical Applications in Language Corpora (Lódz, Poland, 10-14
April 1997: proceedings). Lódz: |
|||||
Learner Corpora Conferences
|
||||||
|
SUMMER SCHOOL: Learner Corpus Research: Theory and Applications |
13-17 September 2004, |
|||||
|
14-16 December, 1998, The |
||||||
|
4-9 August 1996, part of AILA '96, in |
||||||
|
September 11th-13th, 2003, Albert-Ludwigs University
Freiburg i.Br., |
||||||
Summer Schools, Short Courses, One-offs |
|
|
Seminar
on "New trends in corpus linguistics for language teaching and
translation studies" (In honour of John Sinclair) |
Seminar-cum-conference to be held in |
|
Seminar on
"Corpora in Discourse Analysis and in Language Teaching" |
Seminar to be held at the ESSE 9 conference (European
Society for the Study of English) in |
|
Computing, Theorizing, Communicating |
120th MLA Annual Convention, Session 658, 27-30 December
2004, |
|
One-day Workshop (free): Corpora (Electronic Texts) Applied to Language Teaching (CALT) Workshop |
28 February 2005, |
|
One-day Workshop (free): IT & Linguistics (Tips and Tricks for Teaching Linguistics with Technology) |
17 February 2005, CILT, |
|
Learner Corpus Research: Theory And Applications |
13-17
September 2004 Louvain-la-Neuve, Belgium. Organised by the Centre
for English Corpus Linguistics. |
|
Australasian Language Technology Summer School 2004 (ALTSS 2004) |
4-7 December 2004, In conjunction with the Australasian Language Technology Workshop (ALTWS 2004) and the Australian International Conference on Speech Science & Technology (SST 2004). |
|
Short Course at Birmingham Univ: Using a Corpus in Learning and Teaching English for Academic Purposes |
6th – 8th September, 2004, |
|
Tuscan Word Centre courses: "How to use corpora in language work" |
19th-22nd May, and 26th-29th May 2003 |
Translation Conferences |
|
|
|
27-29th July 2010, |
|
Using Corpora in Contrastive and Translation Studies (UCCTS 2008) |
25-27 September 2008, |
|
1st Athens International Conference on Translation and Interpretation |
13-14 Oct 2006, |
|
16-17 November 2006, Kensington, 29-30 November 2007, Kensington, |
|
|
Building
and Using Parallel Texts: Data Driven Machine Translation and Beyond |
May 31, 2003, HLT-NAACL 2003 Workshop |
|
Corpus Use and Learning to Translate (CULT-BCN 2004) |
|
|
Corpus Use and Learning to Translate (CULT 2K) |
3-4 November 2000, |
|
Corpus Use and Learning to Translate (CULT 1997) |
|
|
August
30 - September 3, 2004, Fortaleza, Ceará, Brazil. One of the thematic sessions will be on Translation and Corpora (papers dealing with any aspects - theoretical, practical, pedagogical - involving the use of corpora in translation) |
|
|
28th May 2002, |
|
|
Translation: From Theory to Practice and from Practice to Theory |
3-5 July 2003,
Université de Bretagne Sud, Lorient, France. |
Other Possibly Relevant Conferences
|
|
|
7 September 2009, |
|
|
The many faces of Phraseology: An interdisciplinary conference |
13-15 October 2005, Louvain-la-Neuve, Belgium. Organised by the UCL Centre for English Corpus Linguistics (CECL) |
|
TELRI (Trans-European
Resources Infrastructure) 26-28 September 2002, International Centre of Croatian
Universities, |
|
|
TELRI Seminar: Multilingual Corpus Research (6th TELRI Seminar) |
9-11 November 2001, |
|
19th-21st November, 2003, |
|
|
3čmes Journées de la Linguistique de Corpus |
11-13
Septembre 2003, Lorient, France |
|
2čmes Journées de la Linguistique de Corpus |
12-14 Septembre 2002, |
|
Journées de la Linguistique de Corpus |
14 Septembre 2001, Lorient, France |
Computational/Technically-oriented Conferences(more for people in LE, NLP, IR, TTS, CL, HLT, etc. -- if you don't know what those acronyms stand for, these conferences are probably not for you!) |
|
|
ACL-2004 Workshop on Multiword Expressions: Integrating Processing |
26th July 2004, |
|
Joint International Conference of the Association for Literary and Linguistic Computing and the Association for Computers and the Humanities. |
June 15-19, 2005, |
|
Joint International Conference of the Association for Literary and Linguistic Computing and the Association for Computers and the Humanities. |
June 11 - 16, 2004, Past Conferences of ALLC/ACH are here. |
|
DRH (Digital Resources for the Humanities) DRH2003 31 August - 3 September 2003 |
a series of annual conferences whose goal is to bring together the creators, users, distributors, and custodians of digital resources in the humanities (NOT focussed on corpora/linguistics -- has a broader remit, concentrating on digital/digitised resources for researchers in history, literature, cultural studies, etc.) |
|
11th EURALEX International Congress |
6-10th July, 2004, |
|
JADT (Journées
internationales d'Analyse statistique des Données Textuelles) |
Louvain-la-Neuve
(Belgium), on March 10-12, 2004 This biennial conference is open to all scholars working in the vast field of textual data analysis; ranging from lexicography to the analysis of political discourse, from documentary research to marketing research, from computational linguistics to sociolinguistics, from the processing of data to content analysis. |
|
Fifth Slovenian and First International Language Technologies Conference |
October 9 - 10, 2006, * development, standardisation and use of language resources; * speech technologies; * machine translation and other multi- and cross-lingual processing; * semantic web and knowledge representation related HLT; * statistical and machine learning of language models; * information retrieval and extraction, question answering; * HLT applications; * presentations of HLT related projects |
|
LREC2002 (3rd International Conference on Language Resources and Evaluation) |
29-31 May 2002, Conference on issues in the design, construction and use of language resources (LR) and the methods for the evaluation of resources, technologies, products and applications. |
|
18th-22nd August 2003, |
|
|
Pre-conference workshop of Corpus Linguistics 2003. |
|
Places and ProjectsUse this as an alphabetical list to find web links related to places, projects, and people. |
|
|
AC/DC Project (Brazilian Portuguese & parallel corpora) |
Acesso a corpora/Disponibilizaçăo de corpora ("access and availability of corpora"), and is one of the activities that is part of the resource centre Linguateca |
|
the major professional society for people working in computer-aided research in literature and language studies, history, philosophy, and other humanities disciplines, and especially research involving the manipulation and analysis of textual materials. The ACH is devoted to disseminating information among its members about work in the field of humanities computing, as well as encouraging the development and dissemination of significant textual and linguistic resources and software for scholarly research. Publishes the journals Computers and the Humanities and is linked with the Humanist discussion list. |
|
|
ACL NLP-CL
Universe |
a Web catalog/search engine that is devoted to Natural Language Processing and Computational Linguistics Web sites. The Association for Computational Linguistics (ACL) is THE international scientific and professional society for people working on problems involving natural language and computation. Publishes the journal Computational Linguistics |
|
supporting the application of computing in the study of language and literature; remit encompasses not only text analysis and language corpora, but also image processing and electronic editions. Journal: Literary and Linguistic Computing |
|
|
home of the Centre for Corpus Research (and the Centre for English Language Studies (CELS)) |
|
|
Centre for
English Corpus Linguistics, Université catholique de Louvain |
|
|
at King's College |
|
|
Centre for
Corpus Research ( |
part of the |
|
CETH
( |
Center for Electronic Texts in the Humanities |
|
at |
|
|
the website of the Computational Linguistics Group (CLG) at the University of Oxford, UK |
|
|
E-list "Corpus Linguistics and Language Teaching" |
|
|
COBUILD Home Page |
Home of the Bank of English (Sampler here) |
|
International Committee for the Co-ordination and Standardisation of Speech Databases and Assesment Techniques, COCOSDA, has been established to encourage and promote international interaction and cooperation in the foundation areas of Spoken Language Processing, esepcially for Speech Input/Output. |
|
|
click on the link to read back issues, OR try the SIGLEX index of the discussion list (Selected messages to the CORPORA mailing list have been categorized and links to the threads have been provided. The categorization is based on a SIGLEX ontology. The links have been generated automatically based on subject, the date, and the sender. The links include only the years 1997 to the present. Before 2000, the CORPORA archive is not threaded) |
|
|
Corpus Encoding Standard (see also XCES: XML Version of the CES) |
more for the NLP people and larger-corpus builders; a set of encoding standards for corpus-based work and natural language processing applications |
|
now superseded by the Learning and Teaching Support Network (LTSN) |
|
|
Expert Advisory Group on Language Engineering Standards. An initiative of the European Commission which aims to accelerate the provision of standards for: (i) Very large-scale language resources (such as text corpora, computational lexicons and speech corpora); (ii) Means of manipulating such knowledge, via computational linguistic formalisms, mark up languages and various software tools; (iii) Means of assessing and evaluating resources, tools and products. See also the entry for ISLE (International Standards for Language Engineering), which is the world-wide continuation of the EAGLES Project |
|
|
ECHO (European Cultural Heritage Online) or contributor's web site here |
The main goal of ECHO is the establishment of a European infrastructure fostering the transfer of cultural heritage to the Internet under some essential conditions, among them: free access to high-quality documents pertaining to cultural heritage; interoperability between different corpora; co-evolution of corpora, standards, and tools; access to the primary data through an ECHO-portal via scholarly metadata. |
|
ELAN (European Language Resources Activity Network) |
ELAN aspires to link resources developed by the members of
the two Associations (PAROLE and TELRI) with their potential users throughout
a) to reinforce or, where necessary, create international standards by - conforming a significant part of the data of the members of PAROLE and TELRI to a common format providing standardised resources for the following languages : Belgian French, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovene, Swedish, Turkish and Ukrainian; - designing a common query language; b) to operate, on the basis of the common query language, a user community network that will make accessible a large stock of electronic resources, with a clear copyright policy, user support, e-mail user groups, etc. |
|
ELI
(English Language Institute), |
Home of the MICASE (Michigan Corpus of Academic Spoken English) project. |
|
ELDA (European Language Resources Distribution Agency) |
the distribution arm of ELRA: provides the organisational
infrastructure for identifying, classifying, collecting, validating,
marketing, distributing and licensing European language resources (spoken,
written and terminological corpora and related resources); also disseminates
info on HLT (human language technology). |
|
ELRA (European Language Resources
Association) |
non-profit-making organisation founded by the European
Commission in 1995, with the mission of providing a clearing house for
language resources and promoting Human Language Technologies. A focal point for
information related to language resources in |
|
ELSNET is the European Network of Excellence in Human
Language Technologies. Its main objective is to advance human language
technologies in a broad sense by bringing together |
|
|
Electronic
Text Center ( |
Text archive site |
|
European Association for Lexicography |
|
|
Book Companion Site |
|
|
Archives of the discussion list for Language and the Law (sometimes involves corpus work) |
|
|
an online lexical resource for English, based on frame semantics and supported by corpus evidence (the BNC). Aims to document the range of semantic and syntactic combinatory possibilities (valences) of each word in each of its senses, through manual annotation of example sentences and automatic capture and organization of the annotation results.Each FrameNet entry will provide links to other lexical resources, including definitions (from the Concise Oxford Dictionary), WordNet synsets and the COMLEX subcategorization frames. |
|
|
Text archive site |
|
|
Intute: Arts & Humanities Hub |
free online service providing access to the best Web
resources for education and research, selected and evaluated by a network of
subject specialists. There are over 21,000 Web resources listed here that are
freely available by keyword searching and browsing |
|
a comprehensive catalog of language-related Internet resources, hand-reviewed. Includes online language lessons, translating dictionaries, native literature, translation services, software, language schools, or just a little information on a language you've heard about |
|
|
International Computer Archive of Modern and Medieval English. The aim of the organization is to collect and distribute information on English language material available for computer processing and on linguistic research completed or in progress on the material, to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions. |
|
|
(Corpus Technology research group at the Institut für
Deutsche Sprache, |
central non-university institution for the study and
documentation of current usage and recent history of the German language. The
Corpus Technology research group at the IDS hosts the Mannheimer Corpora
project (the world's largest collection of German online corpora, and the COSMAS |
|
INL (Instituut voor Nederlandse Lexicologie) |
The Institute for Dutch Lexicology (based in |
|
an online course in English grammar written primarily for
university undergraduates. Based at UCL, |
|
|
ISCA (International Speech Communication Association) |
main goal of this non-profit organization is "to promote Speech Communication Science and Technology, both in the industrial and Academic areas", covering all the aspects of Speech Communication (Acoustics, Phonetics, Phonology, Linguistics, Natural Language Processing, Artificial Intelligence, Cognitive Science, Signal Processing, Pattern Recognition, etc.)." |
|
ISLE (International Standards for Language Engineering) |
the world-wide continuation of the EAGLES Project |
|
LDC - Linguistic Data Consortium |
creating and sharing linguistic resources (mainly
corpora): data, tools and standards; an open consortium of universities,
companies and government research laboratories. It creates, collects and
distributes speech and text databases, lexicons, and other resources for
research and development purposes. The |
|
a 1-million-word corpus which matches as closely as possible the LOB and FLOB corpora of written British English, except that the year of data collection is 1931, or near to that date (+/- 3 years). The immediate purpose of building this corpus is to make it possible to compare these three temporally equidistant corpora (1931, 1961, 1991): "Pre-LOB", LOB, and FLOB. This will enable us to track grammatical change through a period of 60 years of the 20th century. The new corpus under construction is as yet unnamed |
|
|
Searchable web site for the main discussion list for all aspects of linguistics (including CBL/corpus-based linguistics) |
|
|
describes tools and formats for creating and managing linguistic annotations; the page is no longer being actively maintained. |
|
|
Searchable archive of abstracts from 1986 to date. |
|
|
CECL = Centre for English Corpus Linguistics, home of learner corpora projects |
|
|
LTG
(Language Technology Group, |
Tools, software, etc. |
|
a virtual information center on the wide spectrum of
technologies for dealing with human languages; maintained by the |
|
|
MATE (Multilevel Annotation, Tools Engineering Telematics Project) |
The MATE project aims to facilitate re-use of language resources by addressing the problems of creating, acquiring, and maintaining language corpora. The problems are addressed along two lines: through the development of a standard for annotating resources; through the provision of tools which will make the processes of knowledge acquisition and extraction more efficient. Specifically, MATE will treat spoken dialogue corpora at multiple levels, focusing on prosody, (morpho-) syntax, co-reference, dialogue acts, and communicative difficulties, as well as inter-level interaction. |
|
Machine Readable Spoken English Corpus (MARSEC) |
a corpus of mainly prepared speech, time-aligned with sound files, annotated for prosody, phonemes, part-of-speech. |
|
The |
|
|
a searchable/browsable catalogue of a natural language processing (NLP) software, with summary notes on their features/capabilities, platforms, costs, etc. Includes academic, commercial and proprietary software. The NLSR does not undertake any distribution of the listed software. |
|
|
Open Language Archives Community (OLAC) A metadata initiative for language data and NLP tools |
an international partnership of institutions and individuals; creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources. OLAC works in a simple way: linguistic archives or individual linguists ("Data Providers") describe their resources using the simple OLAC metadata format. This metadata is then electronically "harvested" by "Service Providers" which make the information available to users in the form of a searchable database. Even if the resource itself is not available on the internet (a collection of cassettes, for example), people will still be able to find out what resources exist and where to find them. * Has a search facility covering the resource catalogs of LDC, ELRA and the ACL/DFKI Natural Language Software Registry, and permits single searches to be applied to all catalogs simultaneously. The OLAC cross-archive search engine now harvests 11,000+ records from 12 OLAC archives. |
|
OTA- Oxford Text Archive |
Collections of downloadable texts (mostly literary). |
|
LE-PAROLE (Language Engineering - Preparatory Action for Linguistic Resources Organization for Language Engineering) |
The European PAROLE project aimed to create a large-scale harmonised set of "core" corpora and lexica for over a dozen West European languages in compliance with common guidelines. 14 Western European language groups participated in the PAROLE project. Language corpora and lexica were built according to the same design and composition principles, in the period 1996-1998. For each of these languages, the project has resulted in a 20-million-word text corpus composed according to similar design principles and TEI encoded according to the PAROLE DTD. 250,000 words are POS encoded. Another product of the PAROLE project is a set of harmonised lexica containing a minimum of 20,000 entries provided with morphosyntactic and syntactic information. |
|
PELCRA (Polish and English Language Corpora for Research and Applications) |
Polish and English modern language corpora for research
and other linguistic applications. (co-operation between the |
|
an association of
scholars, educators, publishers, test developers, and education providers
committed to research in Professional English (PE) and the development of
high-quality Professional English resources, products, and services to meet
growing international demands. Professional English (PE) consists of all
spoken and written discourse that is used by working professionals and
professionals-in-training to engage in the work of their profession.
Professional English includes English for science, engineering, technology,
law, medicine, finance, and other professions. |
|
|
NLP at New York University; focus on the application areas of Information Extraction and Machine Translation; long-term goal is to build systems that automatically find the information you're looking for, pick out the most useful bits, and present it in your preferred language, at the right level of detail. One of our main challenges is to endow computers with linguistic knowledge. The kinds of knowledge that we have attempted to encode include vocabularies, morphology, syntax, semantics, grounding, genre variation, and translational equivalence. We work on both deterministic and stochastic knowledge models |
|
|
REAL
Centre (Research in English and Applied Linguistics, at
|
various research projects at |
|
SACODEYL is a web based system for the assisted compilation and open distribution of European teen talk in the context of language education. It includes the collection and distribution of English, French, German, Italian, Lithuanian, Romanian, and Spanish teen talk. It also distributes software tools HERE. You can search the seven corpora HERE, |
|
|
Links on corpora, dictionaries, etc. from the Summer Institute of Linguistics. |
|
|
The goal of SIMPLE project is to add semantic information, selected for its relevance for LE applications, to the set of harmonised multifunctional lexica built for 12 European languages by the PAROLE consortium. |
|
|
SPARKLE (Shallow PARsing and Knowledge extraction for Language Engineering) |
The first goal of SPARKLE is to produce generic software able to reliably produce a unique, correct but simple phrasal-level syntactic analysis of naturally-occurring free text. This software will be capable of practical use for processing of substantial quantities of such (corpus) material. Such phrasal-parsers will be generic in the sense that they aim to be compatible with a variety of extant approaches to lemmatisation, morphological analysis and lexical syntactic tagging and aim to be straightforwardly parameterisable for different (European) languages. The second goal is to develop a lexical acquisition system capable of learning subcategorisation, argument structure and semantic selection preferences for individual predicates from free text containing instances of such predicates. The lexicon acquisition system will also be developed as a parameterisable multilingual software tool incorporating language-independent and-dependent linguistic knowledge concerning membership of predicates in broad semantic classes, (diathesis) alternations, the linking of arguments to thematic relations. |
|
TELRI (Trans-European Language Resources
Infrastructure) |
pan-European alliance of currently 28 focal national language
(technology) institutions with the emphasis on Central and Eastern European
and |
|
TEI Consortium (Text Encoding Initiative) |
an international and interdisciplinary standard that helps libraries, museums, publishers, and individual scholars represent all kinds of literary and linguistic texts for online research and teaching, using an encoding scheme that is maximally expressive and minimally obsolescent |
|
TRACTOR (TELRI Research Archive of Computational Tools & Resources) |
Corpora in TWENTY languages (including Bulgarian, Croatian, Czech, Dutch, English, Estonian, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Romanian, Russian, Serbian, Slovak, Slovenian, Swedish, Turkish, Ukrainian and Uzbek) ; Parallel corpora in a variety of pairings; Software for processing corpus evidence ; Lexicons and other language-information resources. The TRACTOR archive, network and user community are a key
part of the TELRI agenda to build links between the research communities in
Western, Central and |
|
Tuscan Word Centre (TWC) |
A non-profit association devoted to promoting the scientific study of language. It organises one-week high-level courses for language researchers and workers in the language industries. Concentrates on the use of electronic corpora for different purposes, including: translation, automatic or machine-aided language processing, tagging, parsing etc., language teaching support, language learning assistance, lexicography and language reference |
|
UCREL ( |
University Centre For Computer Corpus Research On Language, Lancaster University |
|
VISL project (Visual Interactive Syntax Learning) |
a research and development
project at the Building on a complex web of HTML-pages, CGI-scripts, Java- and Perl-programs, manually annotated text data bases, and Constraint Grammar (CG) tools for automatic analysis, the VISL internet site offers a graphic interface which allows the user, for a wide variety of languages, to analyse corpus examples, textbook material and free running text in an interactive way - choosing between full automatic parsing and guided manual analysis on various levels of complexity. At the core of the analyses, whether manual or automatic, is a clear distinction between form and function on the word, group, and clause levels. Although the automatic grammatical description is based on CG, it can be transformed to different user-specified notational systems, such as tree structures, tagged running text or in-text color codes. |
|
specialises in the area of lexical processes in second language
learning (at the |
|
Did you find this web
site useful? Spotted dead links or want me to add links? Do let me know.
[ If
you've surfed in from somewhere else & want to know what this site is about,
click the home icon to go to my entrance page ]
This page was last updated: 08 September 2009 16:35:50
© David
Lee