Multilingual support #15

scr255 · 2022-09-14T12:44:34Z

Code for English:

from concept import ConceptModel
concept_model = ConceptModel()
concepts = concept_model.fit_transform(images, docs)
# Works correctly!

Guide suggests "Use Concept(embedding_model="clip-ViT-B-32-multilingual-v1") to select a model that supports 50+ languages.":

from concept import Concept
# ImportError: cannot import name 'Concept' from 'concept' --> I guess you mean to import ConceptModel

Importing ConceptModel:

from concept import ConceptModel
concept_model = ConceptModel(embedding_model="clip-ViT-B-32-multilingual-v1")
concepts = concept_model.fit_transform(images, docs)
# TypeError: 'JpegImageFile' object is not subscriptable

The text was updated successfully, but these errors were encountered:

MaartenGr · 2022-09-20T07:15:56Z

Hmmm, there might be something going wrong with the images that you pass to the model. Did the code for you work with the English version?

scr255 · 2022-09-20T08:33:17Z

Hmmm, there might be something going wrong with the images that you pass to the model. Did the code for you work with the English version?

Yes, the English model "clip-ViT-B-32" is working fine, while "clip-ViT-B-32-multilingual-v1" throws the error.

I've tried changing the dataset (all images in .jpeg format), and the same problem happens.

MaartenGr · 2022-09-21T15:30:05Z

Unfortunately, then there seems to be an issue with that specific model processing the images. You could try to embed the images using SentenceTransformers directly and then pass the embeddings to to fit_transform using the parameter image_embeddings. That way, you can also check if there is an issue with a specific image in your dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multilingual support #15

Multilingual support #15

scr255 commented Sep 14, 2022

MaartenGr commented Sep 20, 2022

scr255 commented Sep 20, 2022

MaartenGr commented Sep 21, 2022

Multilingual support #15

Multilingual support #15

Comments

scr255 commented Sep 14, 2022

MaartenGr commented Sep 20, 2022

scr255 commented Sep 20, 2022

MaartenGr commented Sep 21, 2022