Programming for Corpus Linguistics with Python and Dataframes (Elements in Corpus Linguistics)
By:
Sign Up Now!
Already a Member? Log In
You must be logged into Bookshare to access this title.
Learn about membership options,
or view our freely available titles.
- Synopsis
- This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.
- Copyright:
- 2024
Book Details
- Book Quality:
- Publisher Quality
- ISBN-13:
- 9781108916387
- Related ISBNs:
- 9781009486781, 9781009486781
- Publisher:
- Cambridge University Press
- Date of Addition:
- 09/15/24
- Copyrighted By:
- Daniel Keller
- Adult content:
- No
- Language:
- English
- Has Image Descriptions:
- No
- Categories:
- Nonfiction, Language Arts
- Submitted By:
- Bookshare Staff
- Usage Restrictions:
- This is a copyrighted book.