Automated Google Drive Media Intake for Transcription
Automatically syncs new Google Drive files, filters audio/video, and outputs transcription-ready media for downstream workflows.
Created by Chris Moen • Version 1 • 13 steps
Streamline Your Media Processing Pipeline
This workflow automates the extraction and preparation of media files for transcription. By syncing directly with Google Drive, the flow monitors specific folders for new uploads, automatically filters for compatible audio and video formats, and prepares them for downstream processing. It eliminates the manual effort of downloading, sorting, and re-uploading files, ensuring your transcription pipeline stays up to date in real-time.
Seamless Google Drive & AssemblyAI Integration
Leveraging the Google Drive API and robust cloud functions, the template manages complex file batching and metadata synchronization. It uses a persistent Key-Value (KV) store to track sync timestamps, ensuring no file is processed twice. Once identified, media is optimized for AssemblyAI or other transcription engines, building complete job records and metadata summaries automatically.
Key Benefits of Automated Intake
- Time Efficiency: Stop manually moving large video and audio files between platforms.
- Intelligent Filtering: Automatically distinguishes between media and non-media files to save on processing costs.
- Scalable Batching: Handles high volumes of data by planning and batching downloads to prevent API rate limiting.
- Data Integrity: Maintains accurate metadata records and sync history for a fully auditable media workflow.
Tags: google-drive, media-intake, file-sync, transcription-workflow, audio-processing, video-processing, automation
Steps
- Load last sync time (kv)
- List files in Google Drive folder (http)
- Pause before next page (wait)
- Plan Drive file downloads (function)
- Batch Drive file downloads (function)
- Select audio and video files (function)
- Plan media downloads for transcription (function)
- Batch media downloads for transcription (function)
- Build synced media summary (function)
- Build transcription jobs (function)
- Build Drive file metadata records (function)
- Find latest Drive modified time (function)
- Save last sync time (kv)
FAQ
What is the Automated Google Drive Media Intake for Transcription flow?埋
This workflow automates the collection of media from Google Drive by monitoring specific folders for new uploads, filtering for compatible audio and video formats, and preparing them for transcription.
Which services does this media intake automation integrate with?
The flow connects Google Drive and AssemblyAI, using HTTP requests to list files and Key-Value (KV) storage to track sync times and avoid duplicate processing.
How does the flow handle large volumes of video and audio files?
This automation uses a batch processing function to scan your Drive files, filters out non-media formats, and generates transcription-ready metadata and job records for downstream processing.
Can I customize how often the flow checks for new media in Google Drive?
Yes, the workflow utilizes "wait" steps and Key-Value storage to manage pagination and sync timing, ensuring that only the latest modified files are processed in each cycle.