Exploring Natural Language: Working with the British Component of the International Corpus of English

Front Cover
John Benjamins Publishing, 2002 - Language Arts & Disciplines - 342 pages
0 Reviews
ICE-GB is a 1 million-word corpus of contemporary British English. It is fully parsed, and contains over 83,000 syntactic trees. Together with the dedicated retrieval software, ICECUP, ICE-GB is an unprecedented resource for the study of English syntax.Exploring Natural Language is a comprehensive guide to both corpus and software. It contains a full reference for ICE-GB. The chapters on ICECUP provide complete instructions on the use of the many features of the software, including concordancing, lexical and grammatical searches, sociolinguistic queries, random sampling, and searching for syntactic structures using ICECUP's Fuzzy Tree Fragment models. Special attention is given to the principles of experimental design in a parsed corpus.Six case studies provide step-by-step illustrations of how the corpus and software can be used to explore real linguistic issues, from simple lexical studies to more complex syntactic topics, such as noun phrase structure, verb transitivity, and voice.
  

What people are saying - Write a review

We haven't found any reviews in the usual places.

Contents

INTRODUCING ICEGB
2
THE ICEGB GRAMMAR
22
INTRODUCING THE ICE CORPUS UTILITY PROGRAM ICECUP
70
BROWSING THE CORPUS
85
FUZZY TREE FRAGMENTS AND TEXT QUERIES
117
COMBINING QUERIES
177
ADVANCED FACILITIES IN ICECUP 3 1
203
CASE STUDIES USING ICEGB
233
BROADCAST INTERVIEWS
313
LEGAL CROSSEXAMINATIONS
314
UNSCRIPTED SPEECHES
315
DEMONSTRATIONS
316
NEWS BROADCASTS
317
NONBROADCAST SPEECHES SCRIPTED
318
UNTIMED STUDENT ESSAYS
319
BUSINESS LETTERS
321

PRINCIPLES OF EXPERIMENTAL DESIGN WITH A PARSED CORPUS
257
FUTURE PROSPECTS
285
KNOWLEDGE DISCOVERY IN CORPORA
295
AIDING THE ANNOTATION OF CORPORA
297
TEACHING GRAMMAR WITH CORPORA
299
REFERENCES
301
ICE TEXT CATEGORIES AND CODES
307
2 WRITTEN CATEGORIES
308
SOURCES OF ICEGB TEXTS
309
DIRECT CONVERSATIONS
310
TELEPHONE CALLS
312
ACADEMIC WRITING
324
POPULAR WRITING
326
NEWSPAPER REPORTS
327
ADMINISTRATIVEREGULATORY WRITING
329
PRESS EDITORIALS
330
BIBLIOGRAPHICAL AND BIOGRAPHICAL VARIABLES
332
STRUCTURAL MARKUP SYMBOLS
333
A QUICK REFERENCE GUIDE TO THE ICE GRAMMAR
334
SPECIAL CHARACTERS USED IN ICEGB
337
INDEX
338
Copyright

Common terms and phrases

References to this book

All Book Search results »

About the author (2002)

Sean Wallis is a Senior Research Fellow in the Survey of English Usage at University College London.

Bas Aarts is Professor of English Linguistics and Director of the Survey of English Usage at University College London.

Bibliographic information