ITRI-03-06
Marina Santini
Identifying Genres on the Web
The main aim of the proposed research project is to develop computational methods that help identify the genres used on the Web, including novel ones, and the proportions they occur in a large sample of English Web documents. The suggested approach aims at providing much of the qualitative detail that is common to genre analysis supported by the reliability that is assured by quantitative corpus analysis and statistical techniques.