One Agent To Rule Them All: Towards Multi-agent Conversational AI

Repository that accompanies One Agent To Rule Them All: Towards Multi-agent Conversational AI

BBAI (Black-Box Agent Integration) Dataset

The dataset collected for this paper is located in the ./data folder and consists of all question-response pairs used for training the MARS encoder and evaluating the question agents pairing and question response pairing approaches.

MARS Encoder for Multi-agent Response Selection

Our MARS Encoder(Multi-agent Response Selection) model is available for download on HuggingFace here. This model was trained using SentenceTransformers Cross-Encoder class.

Usage and Performance

MARS Encoder can be used like this:

from sentence_transformers import CrossEncoder
model = CrossEncoder('csclarke/MARS-Encoder')
scores = model.predict([('question 1', 'response 1'), ('question 1', 'response 2')])

The model will predict scores for the pairs ('question 1', 'response 1') and ('question 1', 'response 2').

You can use this model also without sentence_transformers and by just using Transformers AutoModel class

Reproduce results

Here is a quick script to reproduce the MARS encoder results on data/test.json.

import json
import numpy as np
from sentence_transformers import CrossEncoder

label_map = ['alexa', 'google', 'houndify', 'recipe', 'dictionary', 'task_manager', 'hotel', 'stock', 'math', 'sport', 'wikipedia', 'mobile', 'banking', 'coffee', 'event_search', 'jokes', 'reminders', 'adasa', 'covid']

# load MARS encoder
model = CrossEncoder('csclarke/MARS-Encoder')

# load test data
test = json.load(open('data/test.json'))

total = 0
count = 0

# This can be made much faster w/batching
for k, v in test.items():
  responses = [(k,v[label]) for label in label_map]
  scores = model.predict(responses) 
  agent = label_map[np.argmax(scores)]
  
  # Skip examples where no valid agent response is presents
  if "none" in v["human"]:
      continue

  total += 1

  if agent in v["human"]:
      count +=1
print("Accuracy: {}".format(count / total))

The baseline models evaluated in this paper can be accessed here:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
.gitignore		.gitignore
README.md		README.md
paper.pdf		paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One Agent To Rule Them All: Towards Multi-agent Conversational AI

BBAI (Black-Box Agent Integration) Dataset

MARS Encoder for Multi-agent Response Selection

Usage and Performance

Reproduce results

About

Releases

Packages

ChrisIsKing/black-box-multi-agent-integation

Folders and files

Latest commit

History

Repository files navigation

One Agent To Rule Them All: Towards Multi-agent Conversational AI

BBAI (Black-Box Agent Integration) Dataset

MARS Encoder for Multi-agent Response Selection

Usage and Performance

Reproduce results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages