GitHub - dimitreOliveira/Google-QUEST-QA-Labeling: (171st place - Top 11%) Repository for the "Google QUEST Q&A Labeling" Kaggle competition.

Published Kaggle kernels:

Google QUEST - EDA and USE Baseline

What you will find

Documentation [link]
- Project working cycle and effort, relevant content and insights [link]
Models [link]
- Training [link]
- Inference [link]
Dataset split [link]
EDA [link]
Scripts [link]

Google QUEST Q&A Labeling

Improving automated understanding of complex question answer content

Kaggle competition: https://www.kaggle.com/c/google-quest-challenge

Overview

Computers are really good at answering questions with single, verifiable answers. But, humans are often still better at answering questions about opinions, recommendations, or personal experiences.

Humans are better at addressing subjective questions that require a deeper, multidimensional understanding of context - something computers aren't trained to do well…yet.. Questions can take many forms - some have multi-sentence elaborations, others may be simple curiosity or a fully developed problem. They can have multiple intents, or seek advice and opinions. Some may be helpful and others interesting. Some are simple right or wrong.

Unfortunately, it’s hard to build better subjective question-answering algorithms because of a lack of data and predictive models. That’s why the CrowdSource team at Google Research, a group dedicated to advancing NLP and other types of ML science via crowdsourcing, has collected data on a number of these quality scoring aspects.

In this competition, you’re challenged to use this new dataset to build predictive algorithms for different subjective aspects of question-answering. The question-answer pairs were gathered from nearly 70 different websites, in a "common-sense" fashion. Our raters received minimal guidance and training, and relied largely on their subjective interpretation of the prompts. As such, each prompt was crafted in the most intuitive fashion so that raters could simply use their common-sense to complete the task. By lessening our dependency on complicated and opaque rating guidelines, we hope to increase the re-use value of this data set. What you see is what you get!

Demonstrating these subjective labels can be predicted reliably can shine a new light on this research area. Results from this competition will inform the way future intelligent Q&A systems will get built, hopefully contributing to them becoming more human-like.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Assets		Assets
Dataset split		Dataset split
Documentation		Documentation
EDA		EDA
Model backlog		Model backlog
Scripts		Scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Published Kaggle kernels:

What you will find

Google QUEST Q&A Labeling

Improving automated understanding of complex question answer content

Overview

About

Releases

Packages

Languages

dimitreOliveira/Google-QUEST-QA-Labeling

Folders and files

Latest commit

History

Repository files navigation

Published Kaggle kernels:

What you will find

Google QUEST Q&A Labeling

Improving automated understanding of complex question answer content

Overview

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages