This project allows you to generate image captions using the LLaVA model running on your local machine. It's 100% free and open source, designed to help you quickly create captions for images with the ability to specify a tone of voice.
- Free and open source.
- Generate captions based on images using the LLaVA model.
- Specify the tone of the captions (e.g., professional, casual, humorous, etc.).
- Simple and intuitive command-line usage.
To generate captions for your images, use the following command in your terminal:
caption [IMAGE_FOLDER] [DESTINATION_PATH] [TONE]
caption ./my-image ./my-output professional
[IMAGE_FOLDER]
: The folder containing the images you want to generate captions for.[DESTINATION_PATH]
: The folder where the generated captions will be saved.[TONE]
: The tone of the caption, such asprofessional
,casual
, orhumorous
.
- Clone the repository:
git clone [email protected]:fiqryq/caption-llava.git
- Navigate to the project directory:
cd caption-llava
- Install the required dependencies:
npm install # or yarn install # or pnpm install
- Run the application:
npm start # or yarn start # or pnpm install
- Node.js installed on your machine.
- The LLaVA model configured locally.
- Make sure the LLaVA model is properly set up and running on your machine before using the caption generator.
- The tone parameter is optional. If you don't specify a tone, it defaults to a general caption style.
Contributions are welcome! Feel free to open an issue or submit a pull request to improve the project.
This project is licensed under the MIT License. See the LICENSE file for more details.