Skip to content

com.atproto.sync.subscribeRepos

Lexicon Version: 1

Type: subscription

Repository event stream, aka Firehose endpoint. Outputs repo commits with diff data, and identity update events, for all repositories on the current server. See the atproto specifications for details around stream sequencing, repo versioning, CAR diff format, and more. Public and does not require auth; implemented by PDS and Relay.

Parameters:

NameTypeReq’dDescriptionConstraints
cursorintegerThe last known event seq number to backfill from.
Message:
  • Schema:

Schema Type: Union of:
  #commit
  #sync
  #identity
  #account
  #info

Possible Errors:

  • FutureCursor
  • ConsumerTooSlow: If the consumer of the stream can not keep up with events, and a backlog gets too large, the server will drop the connection.

Type: object

Represents an update of repository state. Note that empty commits are allowed, which include no repo data changes, but an update to rev and signature.

Properties:

NameTypeReq’dNullableDescriptionConstraints
seqintegern/aThe stream sequence number of this message.
rebasebooleann/aDEPRECATED — unused
tooBigbooleann/aDEPRECATED — replaced by #sync event and data limits. Indicates that this commit contained too many ops, or data size was too large. Consumers will need to make a separate request to get missing data.
repostringn/aThe repo this event comes from. Note that all other message types name this field ‘did’.Format: did
commitcid-linkn/aRepo commit object CID.
revstringn/aThe rev of the emitted commit. Note that this information is also in the commit object included in blocks, unless this is a tooBig event.Format: tid
sincestringThe rev of the last emitted commit from this repo (if any).Format: tid
blocksbytesn/aCAR file containing relevant blocks, as a diff since the previous repo state. The commit must be included as a block, and the commit block CID must be the first entry in the CAR header ‘roots’ list.Max Length: 2000000
opsArray of #repoOpn/aMax Items: 200
blobsArray of cid-linkn/a
prevDatacid-linkn/aThe root CID of the MST tree for the previous commit from this repo (indicated by the ‘since’ revision field in this message). Corresponds to the ‘data’ field in the repo commit object. NOTE: this field is effectively required for the ‘inductive’ version of firehose.
timestringn/aTimestamp of when this message was originally broadcast.Format: datetime

Type: object

Updates the repo to a new state, without necessarily including that state on the firehose. Used to recover from broken commit streams, data loss incidents, or in situations where upstream host does not know recent state of the repository.

Properties:

NameTypeReq’dDescriptionConstraints
seqintegerThe stream sequence number of this message.
didstringThe account this repo event corresponds to. Must match that in the commit object.Format: did
blocksbytesCAR file containing the commit, as a block. The CAR header must include the commit block CID as the first ‘root’.Max Length: 10000
revstringThe rev of the commit. This value must match that in the commit object.
timestringTimestamp of when this message was originally broadcast.Format: datetime

Type: object

Represents a change to an account’s identity. Could be an updated handle, signing key, or pds hosting endpoint. Serves as a prod to all downstream services to refresh their identity cache.

Properties:

NameTypeReq’dDescriptionConstraints
seqinteger
didstringFormat: did
timestringFormat: datetime
handlestringThe current handle for the account, or ‘handle.invalid’ if validation fails. This field is optional, might have been validated or passed-through from an upstream source. Semantics and behaviors for PDS vs Relay may evolve in the future; see atproto specs for more details.Format: handle

Type: object

Represents a change to an account’s status on a host (eg, PDS or Relay). The semantics of this event are that the status is at the host which emitted the event, not necessarily that at the currently active PDS. Eg, a Relay takedown would emit a takedown with active=false, even if the PDS is still active.

Properties:

NameTypeReq’dDescriptionConstraints
seqinteger
didstringFormat: did
timestringFormat: datetime
activebooleanIndicates that the account has a repository which can be fetched from the host that emitted this event.
statusstringIf active=false, this optional field indicates a reason for why the account is not active.Known Values: takendown, suspended, deleted, deactivated, desynchronized, throttled

Type: object

Properties:

NameTypeReq’dDescriptionConstraints
namestringKnown Values: OutdatedCursor
messagestring

Type: object

A repo operation, ie a mutation of a single record.

Properties:

NameTypeReq’dNullableDescriptionConstraints
actionstringn/aKnown Values: create, update, delete
pathstringn/a
cidcid-linkFor creates and updates, the new record CID. For deletions, null.
prevcid-linkn/aFor updates and deletes, the previous record CID (required for inductive firehose). For creations, field should not be defined.

{
"lexicon": 1,
"id": "com.atproto.sync.subscribeRepos",
"defs": {
"main": {
"type": "subscription",
"description": "Repository event stream, aka Firehose endpoint. Outputs repo commits with diff data, and identity update events, for all repositories on the current server. See the atproto specifications for details around stream sequencing, repo versioning, CAR diff format, and more. Public and does not require auth; implemented by PDS and Relay.",
"parameters": {
"type": "params",
"properties": {
"cursor": {
"type": "integer",
"description": "The last known event seq number to backfill from."
}
}
},
"message": {
"schema": {
"type": "union",
"refs": [
"#commit",
"#sync",
"#identity",
"#account",
"#info"
]
}
},
"errors": [
{
"name": "FutureCursor"
},
{
"name": "ConsumerTooSlow",
"description": "If the consumer of the stream can not keep up with events, and a backlog gets too large, the server will drop the connection."
}
]
},
"commit": {
"type": "object",
"description": "Represents an update of repository state. Note that empty commits are allowed, which include no repo data changes, but an update to rev and signature.",
"required": [
"seq",
"rebase",
"tooBig",
"repo",
"commit",
"rev",
"since",
"blocks",
"ops",
"blobs",
"time"
],
"nullable": [
"since"
],
"properties": {
"seq": {
"type": "integer",
"description": "The stream sequence number of this message."
},
"rebase": {
"type": "boolean",
"description": "DEPRECATED -- unused"
},
"tooBig": {
"type": "boolean",
"description": "DEPRECATED -- replaced by #sync event and data limits. Indicates that this commit contained too many ops, or data size was too large. Consumers will need to make a separate request to get missing data."
},
"repo": {
"type": "string",
"format": "did",
"description": "The repo this event comes from. Note that all other message types name this field 'did'."
},
"commit": {
"type": "cid-link",
"description": "Repo commit object CID."
},
"rev": {
"type": "string",
"format": "tid",
"description": "The rev of the emitted commit. Note that this information is also in the commit object included in blocks, unless this is a tooBig event."
},
"since": {
"type": "string",
"format": "tid",
"description": "The rev of the last emitted commit from this repo (if any)."
},
"blocks": {
"type": "bytes",
"description": "CAR file containing relevant blocks, as a diff since the previous repo state. The commit must be included as a block, and the commit block CID must be the first entry in the CAR header 'roots' list.",
"maxLength": 2000000
},
"ops": {
"type": "array",
"items": {
"type": "ref",
"ref": "#repoOp",
"description": "List of repo mutation operations in this commit (eg, records created, updated, or deleted)."
},
"maxLength": 200
},
"blobs": {
"type": "array",
"items": {
"type": "cid-link",
"description": "DEPRECATED -- will soon always be empty. List of new blobs (by CID) referenced by records in this commit."
}
},
"prevData": {
"type": "cid-link",
"description": "The root CID of the MST tree for the previous commit from this repo (indicated by the 'since' revision field in this message). Corresponds to the 'data' field in the repo commit object. NOTE: this field is effectively required for the 'inductive' version of firehose."
},
"time": {
"type": "string",
"format": "datetime",
"description": "Timestamp of when this message was originally broadcast."
}
}
},
"sync": {
"type": "object",
"description": "Updates the repo to a new state, without necessarily including that state on the firehose. Used to recover from broken commit streams, data loss incidents, or in situations where upstream host does not know recent state of the repository.",
"required": [
"seq",
"did",
"blocks",
"rev",
"time"
],
"properties": {
"seq": {
"type": "integer",
"description": "The stream sequence number of this message."
},
"did": {
"type": "string",
"format": "did",
"description": "The account this repo event corresponds to. Must match that in the commit object."
},
"blocks": {
"type": "bytes",
"description": "CAR file containing the commit, as a block. The CAR header must include the commit block CID as the first 'root'.",
"maxLength": 10000
},
"rev": {
"type": "string",
"description": "The rev of the commit. This value must match that in the commit object."
},
"time": {
"type": "string",
"format": "datetime",
"description": "Timestamp of when this message was originally broadcast."
}
}
},
"identity": {
"type": "object",
"description": "Represents a change to an account's identity. Could be an updated handle, signing key, or pds hosting endpoint. Serves as a prod to all downstream services to refresh their identity cache.",
"required": [
"seq",
"did",
"time"
],
"properties": {
"seq": {
"type": "integer"
},
"did": {
"type": "string",
"format": "did"
},
"time": {
"type": "string",
"format": "datetime"
},
"handle": {
"type": "string",
"format": "handle",
"description": "The current handle for the account, or 'handle.invalid' if validation fails. This field is optional, might have been validated or passed-through from an upstream source. Semantics and behaviors for PDS vs Relay may evolve in the future; see atproto specs for more details."
}
}
},
"account": {
"type": "object",
"description": "Represents a change to an account's status on a host (eg, PDS or Relay). The semantics of this event are that the status is at the host which emitted the event, not necessarily that at the currently active PDS. Eg, a Relay takedown would emit a takedown with active=false, even if the PDS is still active.",
"required": [
"seq",
"did",
"time",
"active"
],
"properties": {
"seq": {
"type": "integer"
},
"did": {
"type": "string",
"format": "did"
},
"time": {
"type": "string",
"format": "datetime"
},
"active": {
"type": "boolean",
"description": "Indicates that the account has a repository which can be fetched from the host that emitted this event."
},
"status": {
"type": "string",
"description": "If active=false, this optional field indicates a reason for why the account is not active.",
"knownValues": [
"takendown",
"suspended",
"deleted",
"deactivated",
"desynchronized",
"throttled"
]
}
}
},
"info": {
"type": "object",
"required": [
"name"
],
"properties": {
"name": {
"type": "string",
"knownValues": [
"OutdatedCursor"
]
},
"message": {
"type": "string"
}
}
},
"repoOp": {
"type": "object",
"description": "A repo operation, ie a mutation of a single record.",
"required": [
"action",
"path",
"cid"
],
"nullable": [
"cid"
],
"properties": {
"action": {
"type": "string",
"knownValues": [
"create",
"update",
"delete"
]
},
"path": {
"type": "string"
},
"cid": {
"type": "cid-link",
"description": "For creates and updates, the new record CID. For deletions, null."
},
"prev": {
"type": "cid-link",
"description": "For updates and deletes, the previous record CID (required for inductive firehose). For creations, field should not be defined."
}
}
}
}
}