fix: Keep all the Curse Words #23

HouseholdVTuber · 2022-10-18T16:03:36Z

Hi there,
I uncensored the words in ..schrute\Python\office_transcript.csv, and then ran ..schrute\data-raw\get_data.R to save the updated contents to ..schrute\data\theoffice.rda.

For the most part, sex/bastard/asshole/etc. were censored with asterisks, but sex organs(penis/vagina) were censored with l33tspeak like pen1s and vg1n. I replaced those too.

…nd exported changed content to theoffice.rda. Both words censored with asterisks and censored with l33tspeak were restored.

HouseholdVTuber · 2022-10-21T21:03:12Z

I made this change assuming that the transcript of The Office will, well, never change so there would probably be no need to run get_transcript.py. But that script is part of the repo and could theoretically be run again so I'm going to go ahead and modify it to replace the known curse words (s*x, etc.) after they're scraped from the URLs and before they're written to the .csv

bradlindblad · 2023-03-15T23:10:20Z

@HouseholdVTuber let me know when you make those changes. I agree that find-replace in the CSV isn't probably best practice.
Thanks!

Manually uncensored the curse words in python\office_transcript.csv a…

ccdb88b

…nd exported changed content to theoffice.rda. Both words censored with asterisks and censored with l33tspeak were restored.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Keep all the Curse Words #23

fix: Keep all the Curse Words #23

HouseholdVTuber commented Oct 18, 2022

HouseholdVTuber commented Oct 21, 2022

bradlindblad commented Mar 15, 2023

fix: Keep all the Curse Words #23

Are you sure you want to change the base?

fix: Keep all the Curse Words #23

Conversation

HouseholdVTuber commented Oct 18, 2022

HouseholdVTuber commented Oct 21, 2022

bradlindblad commented Mar 15, 2023