Policies
Prices
Your available credits are shown at the top of each AI tool. When you install Flowmata AI addon for the first time, you’ll receive promo credits. These are used first, before any paid credits.
Right now, Flowmata AI is in its promotional period. When you install, you’ll get 5 free credits and access to AI tools at special prices. In the coming months, promo credits and pricing will be updated
Images Generation
All image generation models have these common parameters:
Save prompt in file – will save the prompt in the “description” field of the file. You can check this description in the Details view of the file in Google Drive. On by default.
Save prompt as filename – will save the prompt as the file name (only first 250 symbols).
FLUX.1 [schnell]
FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions.
Prompt – A text description of the image you want to generate.
Maximum length: 2000 symbols.
Steps (max 8)
The number of diffusion steps; higher values can improve quality but take longer.
Price: free during promotional period.
Stable diffusion xl base 1.0
stable-diffusion-xl-base-1.0 is a diffusion-based text-to-image generative model by Stability AI. Generates images based on text prompts.
Prompt – A text description of the image you want to generate.
Maximum length: 5000 symbols.
Negative prompt (not required) – Text describing elements to avoid in the generated image.
Maximum length: 2000 symbols.
Number of steps (max 20) – The number of diffusion steps; higher values can improve quality but take longer to process.
Guidance (max 20) – Controls how closely the generated image should adhere to the prompt; higher values make the image more aligned with the prompt.
Price: 0.01 credit for one image during promotional period.
Lucid-origin
Lucid Origin from Leonardo.AI is their most adaptable and prompt-responsive model to date. Whether you’re generating images with sharp graphic design,
stunning full-HD renders, or highly specific creative direction, it adheres closely to your prompts, renders text with accuracy, and supports a wide array of visual styles and aesthetics – from stylized concept art to crisp product mockups.
Prompt (required) – A text description of the image you want to generate.
Maximum length: 5000 symbols.
Guidance (max 10) – Controls how closely the generated image should adhere to the prompt; higher values make the image more aligned with the prompt.
Height (max 2048) – The height of the generated image in pixels.
Width (max 2048) – The width of the generated image in pixels.
Steps (max 40) – The number of diffusion steps; higher values can improve quality but take longer to process.
Price: 0.01 credit for one image during promotional period.
Phoenix 1.0
Phoenix 1.0 is a model by Leonardo.Ai that generates images with exceptional prompt adherence and coherent text.
Prompt (required) – A text description of the image you want to generate.
Maximum length: 5000 symbols.
Negative prompt (not required) – Text describing elements to avoid in the generated image.
Maximum length: 2000 symbols.
Guidance (max 10) – Controls how closely the generated image should adhere to the prompt; higher values make the image more aligned with the prompt.
Height (max 2048) – The height of the generated image in pixels.
Width (max 2048) – The width of the generated image in pixels.
Steps (max 40) – The number of diffusion steps; higher values can improve quality but take longer to process.
Price: 0.01 credit for one image during promotional period.
Text to speech generation
MeloTTS
MeloTTS is a high-quality text-to-speech library by MyShell.ai
Text input (required) – The text content to be converted to speech.
Maximum length: 4000 symbols.
Price: free during promotional period.
Deepgram Aura-1
Deepgram Aura-1 is a context-aware text-to-speech (TTS) model that applies natural pacing, expressiveness, and fillers based on the context of the provided text. The quality of your text input directly impacts the naturalness of the audio output.
Text input (required) – The text content to be converted to speech.
Maximum length: 2000 symbols.
Speaker – Speaker used to produce the audio
angus – masculine English-Irish Warm, Friendly, Natural
asteria – feminine English-American Clear, Confident, Knowledgeable
arcas – masculine English-American Natural, Smooth, Clear, Comfortable
orion – masculine English-American Approachable, Comfortable, Calm, Polite
orpheus – masculine English-American Professional, Clear, Confident
athena – feminine English-British Calm, Smooth, Professional
luna – feminine English-American Friendly, Natural, Engaging
zeus – masculine English-American Deep, Trustworthy, Smooth
perseus – masculine English-American Confident, Professional, Clear
helios – masculine English-British Professional, Clear, Confident
hera – feminine English-American Smooth, Warm, Professional
stella – feminine English-American Clear, Professional, Engaging
Price: 0.01 credit for one audio during promotional period.
Text AI
Ask AI
The cost of a question/answer is based on the number of tokens used. A token typically equals 2–4 characters, depending on the model. Input tokens (price_in) and output tokens (price_out) are priced differently. Pricing is shown per token.
GPT OSS 120b
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-120b is for production, general purpose, high reasoning use-cases.
price_in: 0.00000035, price_out: 0.00000075
GPT OSS 20b
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases – gpt-oss-20b is for lower latency, and local or specialized use-cases.
price_in: 0.0000002, price_out: 0.0000003
Llama 4 scout 17b
Meta’s Llama 4 Scout is a 17 billion parameter model with 16 experts that is natively multimodal. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
price_in: 0.00000027, price_out: 0.00000085
Llama 3.3 70b
Llama 3.3 70B quantized to fp8 precision, optimized to be faster.
price_in: 0.00000029, price_out: 0.00000225
Llama 3.1 8b
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models. The Llama 3.1 instruction tuned text only models are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.
free during promotional period
Mistral small 3.1 24b
Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities. With 24 billion parameters, this model achieves top-tier capabilities.
price_in: 0.00000035, price_out: 0.00000056
Deepseek r1 distill qwen 32b
DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.
price_in: 0.0000005, price_out: 0.00000488
QwQ 32b
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
price_in: 0.00000066, price_out: 0.000001
Gemma-3-12b
Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Gemma 3 models are multimodal, handling text and image input and generating text output, with a multilingual support in over 140 languages.
price_in: 0.00000035, price_out: 0.00000056