## read the file in using read_delim from readr package
chegg2 = read_delim("data/Bentley University-export10-17-2016_16628777.txt", delim="\t")
## hopefully there are patterns where the bad characters exist
## for example
chegg2$first_name = str_replace_all(chegg2$first_name, "'", "")
## what it comes down to are the options on the various functions
## the readr package is pretty handy, and gets the data in without issues
## from there, you can clean the data, but like I said, hopefully there are some standard columns where the bad data exist
## if there aren't , I can always try to write a loop to find them if you need me to