[FEATURE] Add code snippets to run MeaningBERT locally #1

DennisDavari · 2024-01-16T19:21:22Z

Is your feature request related to a problem? Please describe.
How do you actually use the MeaningBERT metric? I wasn't able to reproduce sensible results with this model.

Describe the solution you'd like
Provide a code snippet for Python which can be used to use the model locally.

Describe alternatives you've considered
I tried this code to run the model locally:

import torch

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("C:/big-files-m5/MeaningBERT")
model = AutoModelForSequenceClassification.from_pretrained("C:/big-files-m5/MeaningBERT")

sentences = ["He wanted to make them pay.", "This sandwich looks delicious."]
inputs = tokenizer(sentences[0], sentences[1], return_tensors="pt", padding=True)

with torch.no_grad():
    outputs = model(**inputs)
    logits = outputs.logits

print(logits)

Even though the code executes successfully I don't get sensible results. For completely identical sentences, I get a low score, and for completely unrelated sentences, I get a high score.

Additional context
I tried to verify whether the results are as they should be by comparing the results from the local model with the results from the remote models, but when I use the model via the "Compute" button on HuggingFace, I don't get any value as a result, and when I want to use the model via the Inference API I always get the value of 1. This is the code which I used to access the model via the Inference API:

import requests

API_URL = "https://api-inference.huggingface.co/models/davebulaval/MeaningBERT"
headers = {"Authorization": f"Bearer xxx"}


def query(payload):
    response = requests.post(API_URL, headers=headers, json=payload)
    return response.json()


output = query({
    "inputs": "He wanted to make them pay. This sandwich looks delicious.",
})

print(output)

github-actions · 2024-01-16T19:21:47Z

Thank you for you interest in improving Deepparse.

davebulaval · 2024-01-23T22:17:33Z

I do not have time this week to investigate the problem. I will look at it next week.

I may have pushed the wrong model online. I did not try the model after the push on HuggingFace.

davebulaval · 2024-02-11T15:20:42Z

@DennisDavari, I have investigated the problem.

I have added code snipped in the README.
I have pushed the wrong model (one trained without data augmentation). I am currently retraining a new one and will make it as soon as possible. I will let you know. Based on the H.F. progress bar, I should have the model by a day or so and should be able to push it soon after.
The Metrics Card is not working and is not available. I will fix it with the new model.

P.S. If you have/create a dataset, I would be more than happy to retrain the model and integrate it here.

davebulaval · 2024-03-03T22:38:51Z

@DennisDavari, I am working on a better fix. I have lost a part of the data augmentation dataset between the article and model releases. The data augmentation is different, and this version's performance seems to be lower. I have created a better data augmentation procedure (I do not know why I did not do this at first). I am currently training the model to validate if performance matches the article. I will make sure to keep you posted.

DennisDavari · 2024-03-04T08:16:29Z

Thank you for the update! I am looking forward for the model!

davebulaval · 2024-03-18T02:38:10Z

@DennisDavari, I have pushed for a better model version, but I sometimes get strange results. I have improved the data augmentation approach and released a new dataset (and model) version. I am working on a third version.

DennisDavari · 2024-03-21T09:39:31Z

Thanks a lot for your effort! Since I am finishing my master thesis soon I just wanted to know if you know about when you will be finishing the third version. I am asking this question because I am considering whether I should use the current version for my master thesis or whether I can still wait for the third version.

davebulaval · 2024-03-21T14:42:00Z

I am currently training V3 as of RN. Maybe 2-3 days.

davebulaval · 2024-03-23T17:58:28Z

I await the last training to see if I can get better quantitative metrics. I have created a Metrics Card to simplify the MeaningBERT use and quickly fix some errors. See here.

davebulaval · 2024-03-24T01:18:13Z

@DennisDavari I just pushed V3 and released weights. Quantitative metrics are better, but I still observed some errors. I have fixed some issues with the metric card, so I recommend using the metric module.

Here is a code snippet:

import evaluate

documents = ["He wanted to make them pay.", "This sandwich looks delicious.", "He wants to eat."]
simplifications = ["He wanted to make them pay.", "This sandwich looks delicious.",
                   "Whatever, whenever, this is a sentence."]

meaning_bert = evaluate.load("davebulaval/meaningbert")

print(meaning_bert.compute(documents=documents, simplifications=simplifications))

DennisDavari added the enhancement New feature or request label Jan 16, 2024

davebulaval closed this as completed Mar 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add code snippets to run MeaningBERT locally #1

[FEATURE] Add code snippets to run MeaningBERT locally #1

DennisDavari commented Jan 16, 2024 •

edited by davebulaval

Loading

github-actions bot commented Jan 16, 2024

davebulaval commented Jan 23, 2024

davebulaval commented Feb 11, 2024 •

edited

Loading

davebulaval commented Mar 3, 2024

DennisDavari commented Mar 4, 2024

davebulaval commented Mar 18, 2024

DennisDavari commented Mar 21, 2024

davebulaval commented Mar 21, 2024

davebulaval commented Mar 23, 2024

davebulaval commented Mar 24, 2024

[FEATURE] Add code snippets to run MeaningBERT locally #1

[FEATURE] Add code snippets to run MeaningBERT locally #1

Comments

DennisDavari commented Jan 16, 2024 • edited by davebulaval Loading

github-actions bot commented Jan 16, 2024

davebulaval commented Jan 23, 2024

davebulaval commented Feb 11, 2024 • edited Loading

davebulaval commented Mar 3, 2024

DennisDavari commented Mar 4, 2024

davebulaval commented Mar 18, 2024

DennisDavari commented Mar 21, 2024

davebulaval commented Mar 21, 2024

davebulaval commented Mar 23, 2024

davebulaval commented Mar 24, 2024

DennisDavari commented Jan 16, 2024 •

edited by davebulaval

Loading

davebulaval commented Feb 11, 2024 •

edited

Loading