Skip to content

Synthetic Instagram Post Generation for Social Media Research

License

Notifications You must be signed in to change notification settings

thalesbertaglia/instasynth

Repository files navigation

InstaSynth

Code for the paper InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with chatGPT for Sponsored Content Detection, published at ICWSM 2024.

You can read the paper here here.

Reference

@inproceedings{bertaglia2024instasynth,
  title={InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection},
  author={Bertaglia, Thales and Heisig, Lily and Kaushal, Rishabh and Iamnitchi, Adriana},
  booktitle={Proceedings of the International AAAI Conference on Web and Social Media},
  volume={18},
  pages={139--151},
  year={2024}
}

Setting up the Repository

  1. Clone the Repository:

    git clone https://github.com/thalesbertaglia/instasynth.git
    cd instasynth
  2. Install Dependencies using Poetry:

    poetry install

    This command will read the pyproject.toml file from the current project, resolve the dependencies and install them.

  3. Activate the Poetry Environment:

    poetry shell

    This will spawn a shell within the virtual environment.

  4. Add your OpenAI API key to the .env file

    OPENAI_API_KEY="YOUR_OPENAI_API_KEY"

If you're using Jupyter:

  1. Start Jupyter Notebook:

    poetry run jupyter notebook

    This will launch the Jupyter Notebook, and you can navigate to the desired .ipynb file to use it.

If you're using another notebook tool, you can usually start it within the activated Poetry shell.