Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add multimodal support to Bedrock Converse integration #1887

Open
tzolov opened this issue Dec 7, 2024 · 0 comments
Open

Add multimodal support to Bedrock Converse integration #1887

tzolov opened this issue Dec 7, 2024 · 0 comments
Assignees
Milestone

Comments

@tzolov
Copy link
Contributor

tzolov commented Dec 7, 2024

The Bedrock Converse integration should support multimodal inputs including documents (PDF, DOC, etc.), images (JPEG, PNG, etc.), and videos (MP4, MOV, etc.). This will enable users to interact with the Bedrock AI models using various media types. Update BedrockProxyChatModel to handle multimodal content

Related: AWS Bedrock Multimodal API documentation

@tzolov tzolov added this to the 1.0.0-M5 milestone Dec 7, 2024
@tzolov tzolov self-assigned this Dec 7, 2024
tzolov added a commit to tzolov/spring-ai that referenced this issue Dec 7, 2024
- Add BedrockMediaFormat class to handle media format conversions for documents, images and videos
- Enhance Media class with builder pattern and comprehensive format constants
- Refactor BedrockProxyChatModel to support multimodal content handling
- Add integration tests for PDF, image and video processing
- Add unit tests for Media and BedrockMediaFormat classes
- Upgrade AWS SDK version from 2.26.7 to 2.29.29
- Remove redundant aws.sdk.version property in favor of awssdk.version

Resolves spring-projects#1887
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant