|
|
|||
|
|
The World Wide Web Access to Corpora Project (W3-Corpora) was run at the Department of Language and Linguistics at the University of Essex. The two year project was funded by the Joint Information Systems Committee (JISC) as part of the W3C-IGE project. BackgroundThe widespread availability of computing infrastructure creates the possibility for linguists and others to consult large collections of texts on-line. Such linguistic corpora represent a valuable but under exploited resource for teaching and research. Uptake has been restricted because of the needs to master a relatively complicated set of techniques. Many linguists do not yet exploit corpus resources in their research or teaching. Moreover, the same is true (but to a much greater extent) of students. It is not generally the case that students and researchers have tried to use corpus resources and found the result unhelpful. Rather, they have not in general tried to use them at all. The reason for this is not inherent conservatism, but the difficulties which face the would-be user of corpus resources, who must make a significant, and otherwise unnecessary, investment in hardware and media, and invest a considerable amount of time and effort learning about corpus searching tools and techniques (which they will typically not otherwise find useful). Description of the projectThe idea of this project was to enable and promote the use of corpus resources by allowing simple and straight forward access, via the WWW, to linguistic corpora. The user only needs access to the WWW to be able to perform corpus searches using a web browsing interface (such as Netscape, Internet Explorer, etc.)
|