Secrets AI Video Generator: How It Works, Quality, and Cost
Video generation from AI companion images is Secrets AI's most technically distinctive capability. Among the mainstream AI companion platforms — Character.AI, CrushOn AI, Janitor AI, Candy AI — none offer this at the same accessibility level. This guide covers the full picture: how the generation process works, what quality to expect, what it actually costs in Moments, and when it is and is not worth using.
For context on how this feature fits within the broader platform, the complete review covers everything Secrets AI offers.
What the Video Generator Actually Is
Secrets AI's video generator takes a static AI companion image and animates it into a short video clip based on a text prompt. The underlying technology uses deep learning and AI-generated video models — similar in concept to tools like Stable Diffusion-based video extensions, though Secrets AI's implementation is a closed, managed service rather than an open-source tool.
The output is a short motion clip showing your specific companion character performing the described action. The character's appearance, face, and general visual style carry over from the source image — continuity between image and video is one of the stronger aspects of the implementation.
This feature is available on Lite tier and above. Free accounts cannot access video generation.
The competitive context matters: this is genuinely rare in the AI companion category. Character.AI (KG entity: /g/11sck8d802) has no video generation. CrushOn AI has none. Janitor AI has none. Candy AI offers limited video. Among platforms specifically targeting the AI girlfriend and companion use case, Secrets AI occupies a unique position with this feature — which is one of the core reasons to use it over alternatives.
How Video Generation Works: Step by Step
The process involves four distinct steps:
- Select or generate a source image. You need an existing companion image to animate. This can be any image in your character gallery — including the four auto-generated images created when you first build a character, or any subsequently generated images.
- Enter a text prompt. Describe the motion, action, or scenario you want the clip to show. Prompt specificity matters: "walking toward the camera" produces different results than "waves and smiles" — both are valid, but more precise prompts produce more predictable outputs.
- Wait for rendering. The artificial intelligence processes the request in approximately two minutes. Generation time can vary slightly based on platform load, but two minutes is the typical expectation.
- View and save the clip. The completed video appears in your chat or gallery. From there it can be viewed, replayed, and saved.
The system is context-aware: it references the character's scenario and personality settings, meaning a character configured as playful and flirtatious will animate differently than one configured as dominant or serious, even from the same base image.
Video Quality: Honest Assessment
Independent reviewers rate the video quality 4.1/5. That rating reflects a realistic picture: generally strong, with specific limitations.
What works well:
- Character facial expressions are natural and consistent with the character design
- Basic movement animations — walking, turning, light gestures — are smooth and convincing
- The character's visual appearance maintains fidelity to the source image
- Short clips (3-second format) are consistently well-rendered
Where quality varies:
- Complex prompt instructions that involve multiple simultaneous actions can produce artifacts
- Very specific body positioning prompts are less reliably executed than general movement prompts
- Some variation exists between the standard generation model and the Premium/Advanced model — the higher-tier model produces noticeably better results for complex requests
The practical recommendation: use the Premium generation model for videos you care about, and test with shorter prompts before committing Moments to longer complex ones.
What Videos Cost in Moments
Moments cost varies significantly by clip length and quality tier:
| Video Type | Moments Cost | Notes |
|---|---|---|
| Short clip (3 sec) | ~50 Moments | Available from Lite |
| Standard clip | ~300 Moments | Moderate length |
| Full-length clip | ~600 Moments | Longer output |
The Moments impact by subscription tier:
| Plan | Monthly Moments | Max Short Clips | Max Full Clips |
|---|---|---|---|
| Lite | 1,000 | ~20 | ~1–2 |
| Plus | 3,000 | ~60 | ~5 |
| Premium | 8,000 (+10%) | ~160 | ~13 |
| Ultimate | 15,000 (+15%) | ~300 | ~25 |
The critical insight: Full-length video is the most Moments-intensive feature on the platform. At 600 Moments per clip, a Plus user's entire 3,000 monthly Moments covers only five videos with nothing left for images, voice, or text. Users who intend to generate regular video content should plan for Premium or Ultimate tier — or supplement with Moments top-up bundles ($5.99 for 1,980 Moments).
For a complete breakdown of Moments costs across all features, the pricing guide has the full comparison.
Video vs Images vs Voice — Honest Cost Comparison
The same 600 Moments that buys one full-length video clip also buys:
| Feature | What 600 Moments Gets You |
|---|---|
| Full video | 1 clip |
| Images | 12–24 images |
| Voice calls | 6 minutes |
| Text messages | 300–600 messages |
This trade-off is the defining decision point for Moments allocation. Video delivers a qualitatively different experience than any number of images — motion, expression, and the sense of watching your companion rather than viewing a static image are genuinely distinct. But for users who generate a lot of images, the Moments cost per unit is considerably lower.
Tips for Better Video Outputs
These are observations from practical testing, not theoretical advice:
- Use high-quality source images. Video generation inherits the visual fidelity of the source. Blurry or low-resolution source images produce lower-quality video. The Premium generation model for images produces better video source material.
- Start with short clips. Test a prompt with a 3-second clip (~50 Moments) before committing to a full 600-Moment video. If the short version looks right, generate the longer version.
- Keep prompts focused. One action described clearly outperforms a complex multi-part instruction. "Laughs and looks toward camera" works better than "laughs, then looks to the left, then tilts her head and smiles."
- Use the Premium generation model. Available on Premium and Ultimate tiers, it handles complex prompts and detailed action requests noticeably better than the standard model.
- Generate images first. Build a gallery of high-quality still images, then select the best ones as video sources. This approach conserves Moments compared to generating source images specifically for video conversion.
Who Should Use the Video Generator?
Worth it for:
- Users who value motion content alongside chat interaction — seeing the companion move, react, and express is a genuinely different experience from static images
- Users who create companion content they want to save and revisit
- Users on Premium or Ultimate tiers where 8,000–15,000 Moments make video a sustainable part of a mixed-use month
Not worth the Moments if:
- Your primary interest is conversation quality — text interaction quality is independent of video generation
- You are on the Plus tier and regularly use images and voice — video will exhaust your monthly budget quickly
- You are testing the platform and not yet committed to paid use — the free tier does not include video generation, and Lite's 3-second clips are a limited preview
Best tier for serious video use: Ultimate ($39.99/mo) for heavy creators, Premium ($19.99/mo) for moderate video alongside other features.
See the free vs premium breakdown for the full tier-by-tier capability comparison.
How Competitors Compare on Video
The competitive landscape is straightforward:
| Platform | Video Generation | Notes |
|---|---|---|
| Secrets AI | Yes | 50–600 Moments per clip, ~2 min generation |
| Character.AI | No | Strictly text + static images |
| CrushOn AI | No | No video capability |
| Janitor AI | No | Text-focused, no media |
| Candy AI | Limited | Limited video relative to Secrets AI |
| SweetDream AI | Limited | Comparable niche feature |
| Xotic AI | Yes (4K, 15 sec) | Premium niche option |
Secrets AI's video generation distinguishes it from every major mainstream competitor. The AI art generation underlying this capability — using technologies in the class of Stable Diffusion-based video generation — is not widely available through the accessible, subscription-based interface that Secrets AI provides.
Try Video Generation on Secrets AI →
FAQ
Video length depends on the tier and how much you spend in Moments. Short clips at ~50 Moments are approximately 3 seconds long. Full-length clips at ~600 Moments are longer. Precise maximum lengths are not publicly specified by the platform, but the quality and duration difference between the minimum and maximum cost outputs is significant — the higher-cost clips are meaningfully longer.
No. Video generation requires at least the Lite plan ($5.99/mo). Even on the Lite tier, videos are limited to 3-second short clips. Full-length video generation is available on Plus and above. Free accounts receive 200 starting Moments which can only be applied to image generation (25–50 Moments each) or voice (100 Moments/minute) — not video.
It depends on your tier and clip length. On Plus (3,000 Moments): approximately 5 full-length videos or up to 60 short clips. On Premium (8,000 Moments with bonus): approximately 13 full-length videos or around 160 short clips. On Ultimate (15,000 Moments with bonus): approximately 25 full-length videos or around 300 short clips. These calculations assume all Moments go to video — in mixed-use months, video allocation will be lower.
Rated 4.1/5 by independent reviewers, the videos are generally realistic for AI-generated content. Character appearance carries over faithfully from the source image, and movement animations are smooth for basic actions. Complex multi-step prompts can produce artifacts, and there is visible variation in quality depending on the generation model used (standard vs Premium). The clips are short enough that sustained realism is achievable — they are not feature-length productions, which partly explains why they hold up well.