Description
QuikVox AI is a powerful WordPress assistant designed to streamline your podcast production workflow. By leveraging the advanced capabilities of Google Gemini AI, it allows you to generate high-quality scripts from your existing content and convert them into natural-sounding audio—all without leaving your WordPress dashboard.
Whether you are an AI news blogger or a content creator looking to expand into audio, QuikVox AI provides the tools to automate the tedious parts of scriptwriting and voice generation.
Key Features
- AI-Powered Script Generation: Automatically extract content from your posts/pages and generate professional podcast scripts using Google Gemini (Flash/Pro/Flash-Lite models).
- Natural AI Voices: Convert scripts into audio using the latest Gemini 2.5 TTS (Generative Audio) models. Choose from 15+ high-quality voices with distinct characteristics.
- Multilingual Support: Generate content in 5 major languages: Japanese, English, Chinese (Simplified/Traditional), and Korean.
- Prompt Management: Save and manage custom prompt sets for different podcast styles. Includes an AI translation tool to help you expand your prompts globally.
- Seamless Media Integration: Generated audio files are automatically saved to your WordPress Media Library and can be embedded directly into your articles via a simple audio player.
- Post & Page Support: Works with both standard Posts and Pages, allowing you to turn any content into audio.
Advanced Voice Selection
Unlike basic TTS plugins, QuikVox AI provides detailed metadata for each voice:
* Gender Identification: Clear male/female labels.
* Tone Characteristics: Voices are tagged with their unique style (e.g., “Warm, deep, informative” or “Soft, narrating”).
* Dynamic UI: The editor dropdown shows icons and descriptions so you can find the perfect voice for your persona.
External Services
This plugin connects to an external service:
- Google Gemini API
– Purpose: Generate podcast scripts from post content and convert text to speech
– Data sent: Post content and optional user prompts
– Service provider: Google
– Privacy Policy: https://policies.google.com/privacy
– Terms of Service: https://policies.google.com/terms
Users must provide their own API key to use this feature. Script and voice generation data is sent only when the user triggers generation actions.
- QuikVox AI License Verification Service
– Endpoint: https://quikvox-ai.com/license/verify
– Purpose: Verify license keys and refresh plan/status information
– Data sent: License key, site URL, home URL, and plugin version
– Service provider: QuikVox AI
– Triggered only when:
– an administrator saves or activates a license key
– an administrator clicks the “License Recheck” button
– the QuikVox AI Settings page is opened and the scheduled next check time has passed
No license verification request is sent from normal front-end page views, post views, or general admin screens.
Installation
- Upload the
quikvox-aifolder to the/wp-content/plugins/directory. - Activate the plugin through the ‘Plugins’ menu in WordPress.
- Navigate to Settings > QuikVox AI to enter your Google Gemini API Key and configure the main plugin settings.
- Use the QuikVox AI top-level menu for Talk Scripts and Audio Analytics.
- Optionally, configure Your Voice Generation API Key to enable TTS.
- Start creating by clicking “Create Podcast Script” in the sidebar of any Post or Page editor.
- If you need to issue a test license, run
scripts/issue-license.ps1from PowerShell and provideLICENSE_SIGNING_SECRETat runtime. You can also pass-IssuedDateto set the issue date explicitly. The script generates a Business or Pro license key without storing the secret in the repository.
FAQ
-
Where do I get a Gemini API Key?
-
You can obtain an API key from the Google AI Studio.
-
Are the audio files hosted locally?
-
Yes, generated audio files (WAV format) are saved directly into your
wp-content/uploadsdirectory and registered in your Media Library for full ownership. -
Does it support multi-speaker podcasts?
-
The current version supports single-speaker script generation. Multi-speaker support (dialogue) is available as a Pro feature.
Reviews
There are no reviews for this plugin.
Contributors & Developers
“QuikVox AI” is open source software. The following people have contributed to this plugin.
ContributorsTranslate “QuikVox AI” into your language.
Interested in development?
Browse the code, check out the SVN repository, or subscribe to the development log by RSS.
Changelog
1.0.8
- Reworked the admin menu structure so QuikVox AI appears as a top-level menu with Talk Scripts and Audio Analytics beneath it.
- Kept the main settings page under the WordPress Settings menu and aligned the admin UI with WordPress conventions.
- Improved the admin settings screen by removing inline JavaScript from core controls and tightening settings sanitization.
1.0.7
- Revised Smart Tone admin UI to match Prompt Sets behavior more closely.
- Restored default-star indicators in the Smart Tone list and removed the separate default summary cards.
- Simplified Smart Tone row actions so built-in styles use View/Copy and custom styles use Edit/Delete/Copy as appropriate.
- Added read-only Smart Tone view mode and blocked direct edit/delete operations for built-in styles.
- Documented the current built-in single-speaker Prompt Sets in docs/prompt-sets-single-ja.md.
1.0.6
- Reworked the workflow sidebar and modal layout to clarify selection, generation, and embedding steps.
- Added and reorganized planning docs for roadmap, task tracking, and release context.
- Removed generated pycache artifacts from the repository and ignored future Python cache files.
1.0.5
- Refined the script generator sidebar UI for single and multi-speaker workflows.
- Simplified generation progress popups for script and voice creation.
- Added docs for mockup organization and audio chunk loudness tracking.
1.0.4
- Security: Masked License Key input field with eye icon toggle to prevent credential exposure.
- Security: Masked Service Account JSON (Vertex AI) with blur filter and eye icon toggle.
- Security: Masked Google AI Studio Script Generation API Key with eye icon toggle.
- Security: Masked Google AI Studio Voice Generation API Key with eye icon toggle.
- Fix: Default Gemini model fallback updated from deprecated
gemini-protogemini-2.5-proto resolve 404 errors on script generation. - Fix: Sidebar model fallback list updated to current Gemini 2.5 series.
- Fix: Plugin Check — NonceVerification warnings resolved for redirect notification flags.
- Fix: Plugin Check — Added
wp_unslash()and sanitization to Smart Tone config and auth JSON inputs. - Fix: Plugin Check — Wrapped
error_log()inWP_DEBUGguard (Vertex AI error handler). - Fix: Plugin Check — Replaced
esc_url()withesc_url_raw()for input sanitization in audio URL handler. - Fix: Plugin Check —
stable_tag_mismatchresolved. - Fix: Plugin Check —
plugin_header_nonexistent_domain_pathresolved by creatinglanguages/directory. - Chore: Added
.distignoreto exclude development files from distribution packages. - UI: Updated official website link to https://quikvox-ai.com/.
- UI: Removed redundant “Uses Global Endpoint” label from Vertex AI model selector (behavior is automatic).
1.0.2
- Security: Removed internal API response body from client-facing error messages (Vertex AI TTS and Gemini API).
- Security: Added model ID format validation (regex) for TTS model parameter in voice generation handler.
- Security: Applied input sanitization to Smart Tone text handler for consistency.
- Security: Fixed IDOR vulnerability by adding post read permission check before script generation.
- Security: Replaced unsafe HTML rendering pattern in React component with regex-based tag stripping.
- Security: Corrected URL escaping function to use HTML-context-appropriate method in audio insert handler.
- Security: Suppressed internal URL and model details from client-facing Vertex AI error messages.
- Security: Added Service Account JSON format validation on settings save.
- Security: Removed project_id disclosure from Vertex AI connection test AJAX response.
- UI: Switched connection test status display to textContent to prevent potential HTML injection.
- Removed debug loading log from production script bundle.
1.0.1
- Fixed AI Translate issue in Prompt Sets by improving Gemini 2.5/Thinking model support.
- Implemented comprehensive API response parsing to handle thinking blocks and Markdown code fences.
- Extended API timeout to 60 seconds.
- Removed
response_mime_type: 'application/json'to avoid conflicts with newer Gemini models.
1.0.0
- Official stable release.
- Updated plugin versioning to 1.0.0.
- Verified plan-based features and UI consistency.
0.8.1
- (Previous entries)
- Implemented Vertex AI integration for both script and voice generation.
- Added support for Cloud Text-to-Speech via Vertex AI (MP3 output).
- Implemented text chunking for TTS to handle Gemini TTS byte limits (512 bytes) and timeout issues.
- Improved error handling with a selectable/copyable error modal in the editor.
- Added dynamic download labels (MP3 vs WAV) in the sidebar.
- Fixed endpoint routing for Vertex AI preview models (locations/global).
0.7.8
- Renamed plugin to QuikVox AI (slug: quikvox-ai).
- Migrated all inline scripts/styles to wp_enqueue (admin.js / admin.css).
- Fixed i18n: text domain unified to quikvox-ai and missing 2nd args added.
- Added ABSPATH guards to all PHP files.
0.7.7
- Code consistency improvements for ww.wp.xz.cn standards.
0.7.6
- ww.wp.xz.cn submission preparation.
- Removed ElevenLabs API integration (Gemini TTS only).
- Added External Services disclosure section.
- Security improvements: Enhanced nonce verification and data sanitization.
- Code cleanup for ww.wp.xz.cn compliance.
0.7.5
- Official preparation for ww.wp.xz.cn directory submission.
- Updated Gemini 2.5 TTS (GA) model support.
- Added support for Gemini 1.5 Flash-Lite.
- Enhanced Voice metadata (Name, Gender, Characteristics) in UI.
- Improved buttons: Scripts can now be embedded directly under the player.
- General UI/UX polishing for the admin settings page.
0.5.0
- Initial beta release with basic script generation and TTS support.