Only this pageAll pages
Powered by GitBook
1 of 31

Colossyan API

Loading...

Getting Started

Loading...

Loading...

Basics

Loading...

Loading...

Loading...

Loading...

Loading...

Video Generation

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Avatar creation

Loading...

Advanced

Loading...

Loading...

Loading...

Loading...

Loading...

Experimental

Loading...

Welcome

Colossyan's API offers a programmable video creation service. It allows you to generate videos at scale. To facilitate this, we provide a REST API using standard JSON as request and response payload format. This way you can integrate our service into any technology stack.

Features

  • 💬 Over 600 voices

  • 🌐 Over 70 languages

  • 👄 Use any actor of your choice

  • 👤 Create your own avatars through the API, and use them at your will

  • ⚖️ Generate template videos at a massive scale, easily

  • 📹 Everything that you can generate in the Colossyan Studio you can also generate via our API

    • 🖼️ Embedded images

    • 🎬 Embedded videos

    • 🎶 Embedded audios

    • 🔤 Embedded texts

  • 📞 Get notified immediately via a callback mechanism

  • 💅 Thanks to our web-based templating and payload exporting options it's incredibly easy to get up-n-running.

Jump right in

Getting Started

Create your first video

Basics

Learn the basics of our API

Extracting request bodies from the web-editor

Constructing request bodies for complex video generation jobs can become very complicated very quickly. To get ahead of this we provide a way to use our web-based video-generation tool (Colossyan Studio) to export the payload for the video you see.

  1. To do this, open any draft you'd like to generate via the API.

  2. Click on the "Generate" button on the top right corner of your editor, than in the dialog appeared you should see a button called "Export to API", click that button.

    1. Note that you can only see this button if you have API access.

  3. In the appearing modal, you should see the request body that you can send to generate the exact video that you can see in the editor.

  4. Feel free to customise this payload.

Endpoints

Stable features

https://app.colossyan.com/api/v1

Experimental features

https://app.colossyan.com/api

Assets

Using the API, you can list the following assets:

  • List avatars

  • List voices

In each case, you'll get the assets that the workspace you have generated the token it has access to.

To create a new "instant" avatar please see the following page:

  • Create avatar

Video Generation

There are multiple ways to generate a video.

  • You can generate a video based on a template (that you create). In this case you only send us a reference to the template and a list of dynamic variables that you'd like us to replace in the template. See more here: Generating using a template

  • You can generate a video, by sending a whole video-generation-job descriptor JSON to our API. See more here: Generating a video manually

Video generation through the API does not support interactive videos only regular ones.

Once a video generation has been triggered, you have multiple options to get notified once it's ready. See more here: Receiving a generated video

Generated videos

Throught the API it is possible to retrieve or delete a video that was already generated.

  • Retrieve a video

  • Delete a video

Video generation job

A video generation job holds data respective the video generation.

For example by retrieving the generation job you can tell which status the video generation is currenlty in. (e.g. generating, finished, failed)

  • Retrieve video generation job

  • Delete video generation job

Advanced use-cases

In general, we always recommend to generate videos either by:

  • Extracting the video-generation-job payload from our web-based UI editor. More details here: Extracting request bodies from the web-editor.

  • Or by Generating using a template

If for some reason either of these options wouldn't be enough to achieve what you are after, in the following pages you can read about some advanced concepts, when working with our API.

Using template variables

Dynamic template variables are placeholders within your video script or scene text element that can be replaced with different values each time a video is generated.

They allow you to customize content dynamically without modifying the base template.


Learn more on the following pages on how to use them:

  • Script template variables

  • Text template variables

Quickstart

Follow the next steps to generate your first video using Colossyan's API.

  1. Grab your token as described in the Authentication page.

  2. Send your first request

const token = "<paste-your-token-here>";
const api = "https://app.colossyan.com/api/v1";

const job = {
  videoCreative: {
    settings: {
      name: "My first video",
      videoSize: {
        width: 1920,
        height: 1080
      }
    },
    scenes: [
      {
        tracks: [
          {
            type: "actor",
            position: { x: 420, y: 0 },
            size: { width: 1080, height: 1080 },
            actor: "karen",
            text: "This is my first video generated with Colossyan! Amazing!",
            speakerId: "aquXcfLbkxpW4BBI5qKm"
          }
        ]
      }
    ]
  }
};

const response = await fetch(`${api}/video-generation-jobs`, {
  method: "POST",
  headers: {
    Authorization: `Bearer ${token}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify(job),
});

const result = await response.json();
console.log(result); // { id, videoId }

In the snippet above, we post a new video generation job. Let's walk through what you should expect the generated video to look like.

  • The videos name will be My first video.

  • It will have a width of 1920px and a height of 1080px.

  1. If you successfully sent the request, you should have received two things in return.

    1. An id that you can use to delete or request status updates on the job

    2. A videoId that you'll be able to query the generated video with once it's ready.

  2. Navigate to the workspace where you have created your API key. You should see the video generate. Once it's ready, it should look exactly like this

  1. Congratulations! 🎉 You just generated your first video using our API!

Authentication

You need to have a Business or Enterprise plan to be able to use our API.

Colossyan's API is using Bearer Authentication. This means, that every request has to be authenticated by sending the token in the Authorization header pre-pended by the text Bearer . See the example below:

const response = await fetch(`${api}/video-generation-jobs`, {
  method: "POST",
  headers: {
    Authorization: `Bearer ${token}`, // Authentication
    "Content-Type": "application/json",
  },
  body: JSON.stringify(job),
});

You can create or find your existing tokens at the bottom of the Workspace details tab in the Settings page.

Each token belongs to a specific workspace. This is where are the generated videos that you create through the API will be listed. To keep things tidy, it is recommended to create a specific workspace for each use-case for the API.

List avatars

The endpoint will return the available avatars for your workspace.

The types of avatars the endpoint will return:

  • Studio: Avatars provided by Colossyan.

  • Scenario: Also provided by Colossyan, but shown with a specific scenario background.

  • Instant: Custom-made avatars. To learn how to create one using the API, check out the following page ⇒

  • ssa_lite/ssa_studio: These are also custom-made avatars. To learn more, visit .

There are two different inference models supported by the API (returned by the supportedInferenceModel parameter):

  • legacy

  • neo

There are three different avatar variants supported by the API (returned by the supportedActorTrackVariant parameter):

  • full_body

  • bubble

  • shoulder

Avatar variant
Legacy supported
Neo supported

List voices

Voices

Generating using a template

To generate a video based on a template:

  1. Head over to Colossyan and create your desired video draft using our web-based studio

  2. Once you are done, click on the "Generate" button on the top right corner of your editor, than in the dialog appeared click "Export to API"

Note that this button is only visible if you have a plan which has access to the API. For more information, head over to our

  1. On the top right corner of the appearing dialog click "Save job as a template" and make note of the appearing ID. This is the ID of your template job, you can send this id with your request to start generate the video.

  1. Now you have everything to send us a video generation request using the API endpoint below.

Generating a video manually

You can generate a video by sending us a video-generation-job descriptor JSON. This is essentially a recipe for a video. Constructing such a recipe manually for complex videos can become tedious very quickly. Due to this we strongly recommend to either:

  • Extract the video-generation-job payload from our web-based UI editor. More details here: Extracting request bodies from the web-editor

  • Or to generate based on a template. More details here: Generating using a template


Important concepts

To better understand the API, it's recommended to get an understanding of the following building blocks first:

  • Scenes

    • A scene is a logical consolidation of related content.

    • A video can consist of multiple scenes. These scenes are concatenated together. Think of them as slides on a presentation. Each scene is meant to aggregate and encapsulate related content.

Here you can see the details of the endpoint below.

Retrieve a video

Delete a video

Retrieve video generation job

Delete video generation job

Deleting a video generation job will stop the video generation processing.

Create avatar

You can create an "instant" avatar by sending an image or video link to the API. The avatar will be generated based on the provided media.

The API will return the avatar's name, which can then be used for generating video with the avatar.

Script template variables

You can create dynamic template variables in the script on any scene, by simply typing a variable name between curly braces at any point of the script.

For example in the following script...

,,,

Dear {name}, I have seen that you haven't logged into your account for the past two months. Should we catch up quickly? Since then we released several cool stuff, such as {cool_stuff_1} and {cool_stuff_2}. Let me know if you have a free slot on your calendar next week!

,,,

...you have created three dynamic variables:

  1. name

  2. cool_stuff_1

  3. cool_stuff_2

You can replace these template variables with their values, by sending their values in the request body when generating a video. The API will replace the placeholders in the script with the corresponding values you provide.

For example:

On how to generate a video you can find more information here:

Text template variables

You can use dynamic template variables in any text element, by simply typing a variable name between curly braces.

Take the following example:

,,,

Hey {name}, we’ve been improving!

Our newest feature, {feature_name}, is live!

,,,

In this text we created two template variables: name and feature_name .

You can replace these template variables with their values, by sending their values in the request body when generating a video. The API will replace the placeholders in the text with the corresponding values you provide.

For example:

On how to generate a video you can find more information here:

Knowledge to draft

Experimental endpoints are not yet finalised. They can change at any point, without notice. To reflect this, these endpoints are not available on the /v1/ endpoint (See more here: Endpoints)

Check out our pricing page for more info.

Providing values for the variables

Generating a video

Video Generation

Providing values for the variables

Generating a video

Video Generation
{
    "dynamicVariables": {
        "name": "John",
        "cool_stuff_1": "AI-powered task automation",
        "cool_stuff_1": "Slack & Google Calendar integration",
    },
    // other request parameters...
}
{
    "dynamicVariables": {
        "name": "John",
        "feature_name": "AI-powered task automation",
    },
    // other request parameters...
}
Scenes offer a lot of convenience in timing their content, however it is completely up to the individual to decide how to divide the content into scenes.
  • Tracks

    • Tracks are the building blocks of scenes. You define the content of the video in tracks.

    • There are different types of tracks, for different purposes. To differentiate between them we use the type property.

  • For advanced use-cases and customisations (such as advanced timings or animations), please reach out to us, so we can help out with a solution engineer to make sure you achieve what you are after. Read more here: Advanced use-cases.

    shoulder

    ✅

    ✅

    full_body

    ✅

    ❌

    bubble

    ✅

    Important: Scenario and Instant avatars are not included by default. To access them, please contact support.

    Avatar inference model and variant support

    Important: Not all inference models support all avatar variants.

    Create avatar
    this link

    ✅

    It will consist of a single scene

  • We added an actor track to it, with the actor being karen.

    • We configured the actor to say This is my first video generated with Colossyan! Amazing! using the voice of aquXcfLbkxpW4BBI5qKm.

    • The actor is set to be 1080px * 1080px in size and it's top-left will be in the (420;0) coordinates, counting from the top-left, resulting in the actor being in the middle of the video.

  • Generating the payloads for complex videos can get very complicated, very quickly (multiple scenes, and images and videos etc...). To get ahead of this, we provide a way to export the request body from the Colossyan video editor.

    This way you can use our UI to craft your video templates easily, and then customize them at the last mile.

    Keep in mind: the template job ID refers to the state of the draft when you saved it. If later you edit this draft, you'll need to create another template id to represent the changes.

    pricing page.

    Receiving a generated video

    Upon posting a new video generation job, you have the opportunity to add a callback and some callbackPayload to the job. When the job is successful our service will issue a POST request to the url you provided in the callback field.

    In the body of this post message, we add the following fields:

    Key
    Description

    url

    Contains the public url of the generated video file.

    videoUrl and shareUrl

    Contains a unique url pointing at Colossyans video sharing platform.

    status

    The status of the video generation. Can either be finished or failed

    ...and everything that was provided at the job's generation as callbackPayload and dynamicVariables.

    Due to the asynchronous nature of video-generations, first you need to query the video-generation-job itself, to see the status of it. To do this, use the API below.

    Continue to poll the status of the job, until it returns either finished or failed. In case it successfully finished generating use the API below to get the generated video.

    • You can get the videoId both when queueing the job itself, or when fetching it's status.

    An example of a script to poll a job can be found below:

    1. Open Colossyan

    2. Navigate to workspace in which the API key was used to generate the video

    3. Open the generated videos page.

      1. The video should be listed there

      2. You should also see it if it's currently being generated. In this case, it'll show the status of the job.

    There are three ways to get a video, that was generated via our APIs.

    1. You can get notified via our callback once a video is ready (recommended in production)

    2. You can poll it using other API endpoints

    3. You can download the video from your Colossyan Account after navigating to the workspace where the API key was created.

    Callback

    const token = "<your-token-here>";
    const jobId = "<your-job-id-here>";
    const videoId = "<your-video-id-here>";
    const jobUrl = `https://app.colossyan.com/api/v1/video-generation-jobs/${jobId}`;
    const videoUrl = `https://app.colossyan.com/api/v1/generated-videos/${videoId}`;
    
    async function pollStatus() {
      const jobResponse = await fetch(jobUrl, {
        method: "GET",
        headers: {
          Authorization: `Bearer ${token}`,
          "Content-Type": "application/json",
        },
      });
      const result = await jobResponse.json();
      const status = result.status;
    
      if (status !== "finished" && status !== "failed") {
        console.log(
          `Video status: ${status}, progress: ${result.progress}/${result.maximumProgress}`
        );
        setTimeout(pollStatus, 5000); // Retry every 5 seconds
      } else {
        const videoResponse = await fetch(videoUrl, {
          method: "GET",
          headers: {
            Authorization: `Bearer ${token}`,
            "Content-Type": "application/json",
          },
        });
    
        const result = await videoResponse.json();
        console.log(result);
      }
    }
    
    pollStatus();

    Polling

    Getting the video from the Colossyan App

    Timing

    Timing

    Each track and scene has to have a defined or inferable duration. We provide multiple ways of defining these timing constraints. It is up to you to decide which way you would like to use.

    Scene duration

    You can set the duration of each scene in three different ways. In either case no matter how long the duration of the tracks within the scene are, everything longer than the scene itself will be cut out.

    Explicit scene duration

    You can explicitly define the scene's duration in milliseconds by setting the duration property on it.

    const job = {
      videoCreative: {
        settings: {
          name: "Simple video with a single scene",
          videoSize: { height: 1080, width: 1920 },
        },
        scenes: [
          {
            name: "Scene with explicit duration",
            duration: 2000, // ms
            tracks: [
              {
                type: "actor",
                position: { x: 200, y: 200 }, // From the top left in pixels
                size: { height: 680, width: 1520 }, // In pixels
                text: "This scene is exactly two seconds long. It does not matter how long any of the tracks are.",
                speakerId: "en-GB-RyanNeural",
                actor: "ryan",
              },
            ],
          },
        ],
      },
    };

    Referring to a track

    If you would like your scene to be exactly as long as one of the tracks in it, you can do so by referring to the track on the scene. In this case Colossyan will try to figure out the length of the track in isolation first, and if it can do so, it'll assign the same duration to the scene itself.

    In this case the track duration can only be inferred if it has an default or an explicitly defined duration.

    This is very handy when you would like to make sure that the scene will not end before an actor finishes speaking, or before the end of a video.

    Use the intrinsicDurationTrackReference property in the scene to refer to the referenceId of the track you would like the scene to have an equal duration with.

    const job = {
      videoCreative: {
        settings: {
          name: "Simple video with a single scene",
          videoSize: { height: 1080, width: 1920 },
        },
        scenes: [
          {
            name: "This scene lasts until the actor stops speaking",
            intrinsicDurationTrackReference: "speech",
            tracks: [
              {
                type: "actor",
                referenceId: "speech",
                position: { x: 200, y: 200 }, // From the top left in pixels
                size: { height: 680, width: 1520 }, // In pixels
                text: "It does not matter how long I am talking. The scene will last as long as necessary.",
                speakerId: "en-GB-RyanNeural",
                actor: "ryan",
              },
            ],
          },
        ],
      },
    };

    Combining this technique with timing the tracks by referring to the scene's start and end time is a very powerful combination, that allows you to achieve intricate videos with the flexibility of handing varying track lengths.

    If you do not explicitly set the duration, nor refer to a track within the scene, Colossyan will try to infer the best guess for the duration of the scene by looking at the tracks, and trying to find a single one with either an intrinsic or explicitly set duration.

    This is only possible if there is only one track in the scene with a default or explicitly set duration. Otherwise Colossyan will throw a schema validation error, when queueing the job.

    Track duration can be set in three different ways. No matter which way you chose, if the scene is shorter, the track will be cut to fit it.

    If however the track you provided does not last long enough to reach the duration defined by you (for example a video ends earlier, or the actor finishes speaking), the track will disappear earlier than you defined.

    Similarly to setting a scene's explicit duration, you can set the duration of a track by setting the duration property on it. The duration should be set in milliseconds.

    You can set the track to start after a certain amount of time has passed since the beginning of the scene, and to last until a certain amount of time before the end of the scene. By setting both of these variables, you are essentially defining the duration of the track as well.

    You can define these gaps from the start and end of the scene by setting the startTimeGap and the endTimeGap properties on the track.

    If you do not define the duration of the track in any way, Colossyan will try to fall back to the intrinsic duration of the track if there is any.

    Only video and actor tracks have intrinsic durations.

    • In case of a video track the intrinsic duration equals the duration of the video itself.

    • In case of an actor track the intrinsic duration equals the duration of the lipsynched video of the actor speaking.

    To time when the track should start or end within the scene, you can use the startTimeGap or the endTimeGap properties on the track.

    • Use startTimeGap and endTimeGap to make videos that work well for dynamic content.

    • Refer to the track's duration in scenes to deal with dynamic track lengths.

    • Do not over-constrain the timing of your scenes and tracks. This will yield in a schema validation error.

    Default scene duration

    Track duration

    Explicit track duration

    Referring to the scene's start and end time

    You can only time the track this way if the scene itself does not rely on the track's duration.

    Default track duration

    Timing tracks within a scene

    If you find yourself in a situation where these options are not suitable for your use-case, double check if there is a way to subdivide your content to different scenes, to achieve the result you are looking for.

    Good practices

    const job = {
      videoCreative: {
        settings: {
          name: "Simple video with a single scene",
          videoSize: { height: 1080, width: 1920 },
        },
        scenes: [
          {
            name: "This scene defaults to last until the end of the video",
            tracks: [
              {
                type: "video",
                position: { x: 200, y: 200 }, // From the top left in pixels
                size: { height: 680, width: 1520 }, // In pixels
                videoUrl: "https://your.cdn.com/public/videos/some_video.mp4",
              },
            ],
          },
        ],
      },
    };
    const job = {
      videoCreative: {
        settings: {
          name: "Simple video with a single scene",
          videoSize: { height: 1080, width: 1920 },
        },
        scenes: [
          {
            name: "Scene with a single track with an explicitly set duration",
            tracks: [
              {
                type: "image",
                duration: 3000, // in ms
                position: { x: 200, y: 200 }, // From the top left in pixels
                size: { height: 680, width: 1520 }, // In pixels
                imageUrl: "https://your.cdn.com/public/videos/image.jpg",
              },
            ],
          },
        ],
      },
    };
    const job = {
      videoCreative: {
        settings: {
          name: "Simple video with a single scene",
          videoSize: { height: 1080, width: 1920 },
        },
        scenes: [
          {
            name: "Scene with a single track with a duration set by startTimeGap and endTimeGap",
            duration: 5000,
            tracks: [
              {
                type: "image",
                startTimeGap: 1000, // ms
                endTimeGap: 2000, // ms
                position: { x: 200, y: 200 }, // From the top left in pixels
                size: { height: 680, width: 1520 }, // In pixels
                imageUrl: "https://your.cdn.com/public/videos/image.jpg",
              },
            ],
          },
        ],
      },
    };
    const job = {
      videoCreative: {
        settings: {
          name: "Simple video with a single scene",
          videoSize: { height: 1080, width: 1920 },
        },
        scenes: [
          {
            name: "Scene with a single track with an intrinsic duration",
            tracks: [
              {
                type: "video",
                position: { x: 200, y: 200 }, // From the top left in pixels
                size: { height: 680, width: 1520 }, // In pixels
                imageUrl: "https://your.cdn.com/public/videos/some_video.mp4",
              },
            ],
          },
        ],
      },
    };

    Retrieve a list of voices

    get

    Fetches a list of voices that the user has access to.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Responses
    200

    A list of voices

    application/json
    samplestringRequired

    The sample audio URL for the voice.

    gendersstring[]Required

    The genders the voice is recommended to be used with.

    charactersstring[]Required
    namestringRequired

    The internal name of the voice.

    agesstring[]Optional
    pitchbooleanOptional
    emphasizebooleanOptional
    pausebooleanOptional
    speedbooleanOptional
    stylestringRequired

    The style of the voice.

    scenariosstring[]Required

    The scenarios in which the voice is recommended to be used.

    displayNamestringRequired

    The display name of the voice.

    languagesstring[]Required

    The languages supported by the voice.

    isClonedVoicebooleanOptional

    Whether the voice is cloned or not.

    401

    Unauthorized. Authentication required.

    No content

    500

    Internal server error.

    No content

    get/assets/voices
    post

    Generates and creates a draft using structured textual data.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Body
    post

    Submits a new video generation job using a template

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Body
    templateJobIdstringRequired

    The ID of the template job.

    dynamicVariablesobjectRequired

    Dynamic variables to be loaded into the template.

    callbackUrlstring · nullableOptional

    URL for the callback after job completion.

    callbackPayloadobject · nullableOptional

    Callback payload to return with the generated video once it's ready.

    post

    Submits a new video generation job for processing.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Body
    GET /api/v1/assets/voices HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    
    [
      {
        "sample": "text",
        "genders": [
          "text"
        ],
        "characters": [
          "text"
        ],
        "name": "text",
        "ages": [
          "text"
        ],
        "features": {
          "pitch": true,
          "emphasize": true,
          "pause": true,
          "speed": true
        },
        "style": "text",
        "scenarios": [
          "text"
        ],
        "displayName": "text",
        "languages": [
          "text"
        ],
        "isClonedVoice": true
      }
    ]
    titlestringRequired

    Main title of the content. It's going to be the title of the draft.

    descriptionstringRequired

    Description of the content. Helps to understand the content better.

    titlestringRequired

    Title of the scene.

    scriptstringRequired

    The exact script of the scene.

    searchTermsstringRequired

    Search terms that our system is going to use to find relevant stock images and videos.

    templateIdstring · nullableOptional

    The ID of the template to use. If not provided, the system will randomly select one.

    Responses
    200

    Draft generated and created successfully

    application/json
    urlstringOptional

    URL of the created draft.

    400

    Bad Request. Invalid input data.

    401

    Unauthorized. Authentication required.

    No content

    500

    Internal server error.

    No content

    post/knowledge-to-draft/generate-draft
    POST /api/knowledge-to-draft/generate-draft HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Content-Type: application/json
    Accept: */*
    Content-Length: 136
    
    {
      "summary": {
        "title": "text",
        "description": "text",
        "chapters": [
          {
            "title": "text",
            "script": "text",
            "searchTerms": "text"
          }
        ]
      },
      "templateId": "text"
    }

    Generate a draft

    Responses
    200

    Job created successfully

    application/json
    idstringOptional

    The ID of the created video-generation-job.

    videoIdstringOptional

    The provisioned ID of the generated video.

    400

    Bad Request. Invalid input data.

    401

    Unauthorized. Authentication required.

    No content

    500

    Internal server error.

    No content

    post/video-generation-jobs/template-jobs
    POST /api/v1/video-generation-jobs/template-jobs HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Content-Type: application/json
    Accept: */*
    Content-Length: 88
    
    {
      "templateJobId": "text",
      "dynamicVariables": {},
      "callbackUrl": "text",
      "callbackPayload": {}
    }

    Create a video generation job from a template

    dynamicVariablesobjectOptional

    Dynamic variables to be loaded into the template.

    callbackstringOptional

    URL for the callback.

    callbackPayloadobjectOptional

    Payload for the callback.

    widthnumberRequired

    Width of the video in pixels

    heightnumberRequired

    Height of the video in pixels

    namestringRequired

    Name of the video

    alphaChannelbooleanOptional

    Whether to attach the alpha-channel to the video or not. If true, only single-scene videos containing a single actor tracks are supported.

    namestringOptional

    Name of the scene.

    durationnumberOptional

    Duration of the scene.

    intrinsicDurationTrackReferencestringOptional

    Reference to a track's referenceId in the scene. If the referenced track has a duration, the scene duration will be the same as the track duration.

    typestring · enumRequired

    Type of transition.

    Possible values:
    durationnumberRequired

    Duration of the transition.

    typestring · enumOptional

    Type of track.

    Possible values:
    variantstring · enumOptionalPossible values:
    viewstring · enumOptionalPossible values:
    actorstringOptional

    The name field of the actor

    audioUrlstringOptional

    A publicly accessible URL to the audio file that the actor will speak.

    textstringOptional

    Text content for the actor to speak.

    speakerIdstringOptional

    The name of the voice to use.

    xnumberOptional

    X-coordinate of the track.

    ynumberOptional

    Y-coordinate of the track.

    widthnumberOptional

    Width of the track in pixels.

    heightnumberOptional

    Height of the pixels.

    rotationnumberOptional

    Rotation of the track. (-359 to 359)

    referenceIdstringOptional

    Reference ID of the track.

    or
    typestring · enumOptional

    Type of track.

    Possible values:
    imageUrlstringOptional

    A publicly accessible URL to the image file.

    xnumberOptional

    X-coordinate of the track.

    ynumberOptional

    Y-coordinate of the track.

    widthnumberOptional

    Width of the track in pixels.

    heightnumberOptional

    Height of the pixels.

    rotationnumberOptional

    Rotation of the track. (-359 to 359)

    referenceIdstringOptional

    Reference ID of the track.

    or
    typestring · enumOptional

    Type of track.

    Possible values:
    videoUrlstringOptional

    URL of the video.

    volumenumberOptional

    Volume level.

    xnumberOptional

    X-coordinate of the track.

    ynumberOptional

    Y-coordinate of the track.

    widthnumberOptional

    Width of the track in pixels.

    heightnumberOptional

    Height of the pixels.

    rotationnumberOptional

    Rotation of the track. (-359 to 359)

    referenceIdstringOptional

    Reference ID of the track.

    or
    typestring · enumOptional

    Type of track.

    Possible values:
    textstringOptional

    Text content.

    fontSizenumberOptional

    Font size.

    colorstringOptional

    Color of the text.

    backgroundColorstringOptional

    Background color of the text.

    xnumberOptional

    X-coordinate of the track.

    ynumberOptional

    Y-coordinate of the track.

    widthnumberOptional

    Width of the track in pixels.

    heightnumberOptional

    Height of the pixels.

    rotationnumberOptional

    Rotation of the track. (-359 to 359)

    referenceIdstringOptional

    Reference ID of the track.

    or
    typestring · enumOptional

    Type of track.

    Possible values:
    audioUrlstringOptional

    A publicly accessible URL to the audio file.

    volumenumberOptional

    Volume level for the audio track.

    Responses
    200

    Job created successfully

    application/json
    idstringOptional

    The ID of the created job.

    videoIdstringOptional

    The provisioned ID of the generated video.

    400

    Bad Request. Invalid input data.

    401

    Unauthorized. Authentication required.

    No content

    500

    Internal server error.

    No content

    post/video-generation-jobs
    POST /api/v1/video-generation-jobs HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Content-Type: application/json
    Accept: */*
    Content-Length: 505
    
    {
      "dynamicVariables": {},
      "callback": "text",
      "callbackPayload": {},
      "videoCreative": {
        "settings": {
          "videoSize": {
            "width": 1,
            "height": 1
          },
          "name": "text",
          "alphaChannel": true
        },
        "scenes": [
          {
            "name": "text",
            "duration": 1,
            "intrinsicDurationTrackReference": "text",
            "transition": {
              "type": "fade",
              "duration": 1
            },
            "tracks": [
              {
                "type": "actor",
                "variant": "full_body",
                "view": "front",
                "actor": "text",
                "audioUrl": "text",
                "text": "text",
                "speakerId": "text",
                "position": {
                  "x": 1,
                  "y": 1
                },
                "size": {
                  "width": 1,
                  "height": 1
                },
                "rotation": 1,
                "referenceId": "text"
              }
            ]
          }
        ]
      }
    }

    Create a video generation job

    Get video generation job status

    get

    Retrieves the status, progress, and details of a specific video generation job.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Path parameters
    videoIdstringRequired

    The ID of the video generation job.

    Responses
    200

    Video generation job status and details retrieved successfully. The return value also contains all of the video-generation-job itself, which is not detailed here.

    application/json
    statusstring · enumOptional

    The current status of the video generation job.

    Possible values:
    videoIdstringOptional

    The ID of the provisioned generated video

    progressnumberOptional

    The current progress of the video generation job.

    maximumProgressnumberOptional

    The maximum progress value of the video generation job.

    400

    Bad Request. Invalid videoId parameter.

    401

    Unauthorized. Authentication required.

    No content

    404

    Not Found. Video generation job not found.

    500

    Internal server error.

    No content

    get/video-generation-jobs/{videoId}

    Get details of a generated video

    get

    Retrieves detailed information about a specific generated video.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Path parameters
    videoIdstringRequired

    The ID of the generated video.

    Responses
    200

    Generated video details retrieved successfully

    application/json
    idstringOptional

    The unique identifier of the generated video.

    jobIdstringOptional

    The ID of the video generation job that created this video.

    publicUrlstringOptional

    The public URL of the generated video.

    thumbnailUrlstringOptional

    URL of the video's thumbnail.

    namestringOptional

    The name of the video.

    createdAtstring · date-timeOptional

    The date and time the video was created.

    videoSizeBytesnumber · nullableOptional

    The size of the video in bytes.

    videoDurationSecondsnumber · nullableOptional

    The duration of the video in seconds.

    400

    Bad Request. Invalid videoId parameter.

    401

    Unauthorized. Authentication required.

    No content

    404

    Not Found. Generated video not found.

    500

    Internal server error.

    No content

    get/generated-videos/{videoId}
    {
      "url": "text"
    }
    {
      "id": "text",
      "videoId": "text"
    }
    {
      "id": "text",
      "videoId": "text"
    }
    {
      "status": "in_queue",
      "videoId": "text",
      "progress": 1,
      "maximumProgress": 1
    }
    {
      "id": "text",
      "jobId": "text",
      "publicUrl": "text",
      "thumbnailUrl": "text",
      "name": "text",
      "createdAt": "2026-07-04T16:05:55.995Z",
      "videoSizeBytes": 1,
      "videoDurationSeconds": 1
    }
    GET /api/v1/video-generation-jobs/{videoId} HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    
    GET /api/v1/generated-videos/{videoId} HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    

    Retrieve a list of avatars

    get

    Fetches a list of avatars that the user has access to.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Responses
    200

    A list of avatars

    application/json
    namestringRequired

    Key in the database

    display_namestringRequired

    Display name of the avatar

    preview_urlstringRequired

    URL to the avatar's preview video

    versionstring · enumRequiredPossible values:
    typestring · enumOptionalPossible values:
    genderstring · enumRequiredPossible values:
    ageintegerOptional

    Age of the avatar

    ethnicitystring · enumOptionalPossible values:
    clothing_stylestring · enumOptionalPossible values:
    qualitystring · enumOptionalPossible values:
    emotionsstring[]Optional

    Deprecated field, will be removed in future versions.

    default_voicestringRequired

    Default voice for the avatar

    viewVariantTypestring · enumOptionalPossible values:
    heightnumberOptional
    widthnumberOptional
    rationumberOptional
    itemsstring · enumOptionalPossible values:
    itemsstring · enumOptionalPossible values:
    pictarSourceUrlstringOptional
    thumbnailstringOptional
    editorstringOptional
    editor-happystringOptional
    editor-sadstringOptional
    basestringOptional
    baseTransparentstringOptional
    dynamicPreviewUrlstringOptional
    emotionsstring[]OptionalDeprecated
    leftnumberOptional

    The percentage of the image to crop from the left (0-1)

    topnumberOptional

    The percentage of the image to crop from the top (0-1)

    rightnumberOptional

    The percentage of the image to crop from the right (0-1)

    bottomnumberOptional

    The percentage of the image to crop from the bottom (0-1)

    leftnumberOptional

    The percentage of the image to crop from the left (0-1)

    topnumberOptional

    The percentage of the image to crop from the top (0-1)

    rightnumberOptional

    The percentage of the image to crop from the right (0-1)

    bottomnumberOptional

    The percentage of the image to crop from the bottom (0-1)

    side_viewbooleanOptional
    emotionsbooleanOptional
    hand_gesturebooleanOptional
    401

    Unauthorized. Authentication required.

    No content

    500

    Internal server error.

    No content

    get/assets/actors
    delete

    Deletes a specific video generation job and stops any ongoing processing.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Path parameters
    videoGenerationJobIdstringRequired

    The ID of the video generation job.

    Responses
    200

    Video generation job status and details retrieved successfully. The return value also contains all of the video-generation-job itself, which is not detailed here.

    No content

    400

    Bad Request. Invalid videoGenerationJobId parameter.

    401

    Unauthorized. Authentication required.

    No content

    404

    Not Found. Video generation job not found.

    500

    Internal server error.

    No content

    delete
    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Path parameters
    videoIdstringRequired

    The ID of the generated video.

    Responses
    200

    Video deleted successfully

    application/json
    anyOptional
    400

    Bad Request. Invalid videoId parameter.

    401

    Unauthorized. Authentication required.

    No content

    404

    Not Found. Generated video not found.

    500

    Internal server error.

    No content

    post

    Creates a new avatar from the sent properties.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Body
    displayNamestringOptional

    Name to be displayed for the avatar

    sourceFileUrlstringOptional

    URL to the video or image file for the avatar to be created from

    genderstring · enumOptionalPossible values:
    get

    Retrieves detailed information about a specific generated video.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Path parameters
    videoIdstringRequired

    The ID of the generated video.

    Responses
    200

    Generated video details retrieved successfully

    application/json
    idstringOptional

    The unique identifier of the generated video.

    jobIdstringOptional

    The ID of the video generation job that created this video.

    publicUrlstringOptional

    The public URL of the generated video.

    thumbnailUrlstringOptional

    URL of the video's thumbnail.

    namestringOptional

    The name of the video.

    createdAtstring · date-timeOptional

    The date and time the video was created.

    videoSizeBytesnumber · nullableOptional

    The size of the video in bytes.

    videoDurationSecondsnumber · nullableOptional

    The duration of the video in seconds.

    400

    Bad Request. Invalid videoId parameter.

    401

    Unauthorized. Authentication required.

    No content

    404

    Not Found. Generated video not found.

    500

    Internal server error.

    No content

    get

    Retrieves the status, progress, and details of a specific video generation job.

    Authorizations
    AuthorizationstringRequired
    Bearer authentication header of the form Bearer <token>.
    Path parameters
    videoGenerationJobIdstringRequired

    The ID of the video generation job.

    Responses
    200

    Video generation job status and details retrieved successfully. The return value also contains all of the video-generation-job itself, which is not detailed here.

    application/json
    statusstring · enumOptional

    The current status of the video generation job.

    Possible values:
    videoIdstringOptional

    The ID of the provisioned generated video

    progressnumberOptional

    The current progress of the video generation job.

    maximumProgressnumberOptional

    The maximum progress value of the video generation job.

    400

    Bad Request. Invalid videoGenerationJobId parameter.

    401

    Unauthorized. Authentication required.

    No content

    404

    Not Found. Video generation job not found.

    500

    Internal server error.

    No content

    delete/video-generation-jobs/{videoGenerationJobId}

    No content

    Delete a video generation job

    delete/generated-videos/{videoId}

    No content

    Delete a generated video

    get/generated-videos/{videoId}

    Get details of a generated video

    DELETE /api/v1/video-generation-jobs/{videoGenerationJobId} HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    
    DELETE /api/v1/generated-videos/{videoId} HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    
    GET /api/v1/generated-videos/{videoId} HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    
    {
      "id": "text",
      "jobId": "text",
      "publicUrl": "text",
      "thumbnailUrl": "text",
      "name": "text",
      "createdAt": "2026-07-04T16:05:55.995Z",
      "videoSizeBytes": 1,
      "videoDurationSeconds": 1
    }
    GET /api/v1/assets/actors HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    
    [
      {
        "name": "text",
        "display_name": "text",
        "preview_url": "text",
        "version": "v1",
        "type": "Studio",
        "gender": "Male",
        "age": 1,
        "ethnicity": "Caucasian",
        "clothing_style": "Casual",
        "quality": "lite",
        "emotions": [
          "text"
        ],
        "default_voice": "text",
        "views": [
          {
            "viewVariantType": "front",
            "aspectRatio": {
              "height": 1,
              "width": 1,
              "ratio": 1
            },
            "supportedActorTrackVariants": [
              "full_body"
            ],
            "supportedInferenceModels": [
              "legacy"
            ],
            "pictarSourceUrl": "text",
            "images": {
              "thumbnail": "text",
              "editor": "text",
              "editor-happy": "text",
              "editor-sad": "text"
            },
            "videos": {
              "base": "text",
              "baseTransparent": "text",
              "dynamicPreviewUrl": "text"
            },
            "bubbleVariantDefaults": {
              "crop": {
                "left": 1,
                "top": 1,
                "right": 1,
                "bottom": 1
              }
            },
            "shoulderVariantDefaults": {
              "crop": {
                "left": 1,
                "top": 1,
                "right": 1,
                "bottom": 1
              }
            }
          }
        ],
        "features": {
          "side_view": true,
          "emotions": true,
          "hand_gesture": true
        }
      }
    ]
    Responses
    201

    Avatar created successfully.

    application/json
    400

    Bad Request. Invalid input data.

    401

    Unauthorized. Authentication required.

    No content

    500

    Internal server error.

    No content

    post/assets/actors
    POST /api/v1/assets/actors HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Content-Type: application/json
    Accept: */*
    Content-Length: 61
    
    {
      "displayName": "text",
      "sourceFileUrl": "text",
      "gender": "Male"
    }

    Creates a new avatar.

    get/video-generation-jobs/{videoGenerationJobId}
    GET /api/v1/video-generation-jobs/{videoGenerationJobId} HTTP/1.1
    Host: app.colossyan.com
    Authorization: Bearer YOUR_SECRET_TOKEN
    Accept: */*
    

    Get video generation job status

    {
      "name": "text"
    }
    {
      "status": "in_queue",
      "videoId": "text",
      "progress": 1,
      "maximumProgress": 1
    }