Month: September 2017

LTE: Scraping web-sites to collect data

In this post, I detail how I collected the data for the letters to the editor corpus analysis project. First, I picked several web-sites where I could access letters to the editor archives: Chicago Tribune from Illinois The Citizen from Georgia Daily Herald from Illinois Dubois County Free Press from Indiana Ellsworth American from Maine …

LTE: Scraping web-sites to collect data Read More »

LTE: Letters to the editor corpus analysis using machine learning

See if you can determine whether the author of the following texts is male or female. Text A: When my children were younger, my goal was to get them into the gifted program or even charter schools, because I just wanted a high-quality option. I was thankful that one of my daughters was accepted to …

LTE: Letters to the editor corpus analysis using machine learning Read More »