Read documents

curl --request POST \
  --url https://{domain}-be.glean.com/rest/api/v1/getdocuments \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "documentSpecs": [
    {
      "url": "<string>"
    }
  ],
  "includeFields": [
    "LAST_VIEWED_AT"
  ]
}'

{
  "documents": {}
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Headers

X-Scio-Actas

string

Email address of a user on whose behalf the request is intended to be made (should be non-empty only for global tokens).

X-Glean-Auth-Type

string

Auth type being used to access the endpoint (should be non-empty only for global tokens).

Body

application/json

Information about documents requested.

documentSpecs

object[]

required

The specification for the documents to be retrieved.

includeFields

enum<string>[]

List of Document fields to return (that aren't returned by default)

Available options:

LAST_VIEWED_AT,

VISITORS_COUNT,

RECENT_SHARES,

DOCUMENT_CONTENT

Response

200

application/json

documents

object

The document details or the error if document is not found.

documents.{key}

object

documents.{key}.id

string

The Glean Document ID.

documents.{key}.datasource

string

The app or other repository type from which the document was extracted

documents.{key}.connectorType

enum<string>

The source from which document content was pulled, e.g. an API crawl or browser history

Available options:

API_CRAWL,

BROWSER_CRAWL,

BROWSER_HISTORY,

BUILTIN,

FEDERATED_SEARCH,

PUSH_API,

WEB_CRAWL,

NATIVE_HISTORY

documents.{key}.docType

string

The datasource-specific type of the document (e.g. for Jira issues, this is the issue type such as Bug or Feature Request).

documents.{key}.content

object

containerDocument

object

parentDocument

object

documents.{key}.title

string

The title of the document.

documents.{key}.url

string

A permalink for the document.

documents.{key}.metadata

object

documents.{key}.metadata.datasource

string

documents.{key}.metadata.datasourceInstance

string

The datasource instance from which the document was extracted.

documents.{key}.metadata.objectType

string

The type of the result. Interpretation is specific to each datasource. (e.g. for Jira issues, this is the issue type such as Bug or Feature Request).

documents.{key}.metadata.container

string

The name of the container (higher level parent, not direct parent) of the result. Interpretation is specific to each datasource (e.g. Channels for Slack, Project for Jira). cf. parentId

documents.{key}.metadata.containerId

string

The Glean Document ID of the container. Uniquely identifies the container.

documents.{key}.metadata.superContainerId

string

The Glean Document ID of the super container. Super container represents a broader abstraction that contains many containers. For example, whereas container might refer to a folder, super container would refer to a drive.

documents.{key}.metadata.parentId

string

The id of the direct parent of the result. Interpretation is specific to each datasource (e.g. parent issue for Jira). cf. container

documents.{key}.metadata.mimeType

string

documents.{key}.metadata.documentId

string

The index-wide unique identifier.

documents.{key}.metadata.loggingId

string

A unique identifier used to represent the document in any logging or feedback requests in place of documentId.

documents.{key}.metadata.documentIdHash

string

Hash of the Glean Document ID.

documents.{key}.metadata.createTime

string

documents.{key}.metadata.updateTime

string

documents.{key}.metadata.author

object

documents.{key}.metadata.author.name

string

required

The display name.

documents.{key}.metadata.author.obfuscatedId

string

required

An opaque identifier that can be used to request metadata for a Person.

A list of documents related to this person.

documents.{key}.metadata.author.metadata

object

Example:

{
  "department": "Movies",
  "email": "george@example.com",
  "location": "Hollywood, CA",
  "phone": 6505551234,
  "photoUrl": "https://example.com/george.jpg",
  "startDate": "2000-01-23",
  "title": "Actor"
}

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.owner

object

documents.{key}.metadata.owner.name

string

required

The display name.

documents.{key}.metadata.owner.obfuscatedId

string

required

An opaque identifier that can be used to request metadata for a Person.

A list of documents related to this person.

documents.{key}.metadata.owner.metadata

object

Example:

{
  "department": "Movies",
  "email": "george@example.com",
  "location": "Hollywood, CA",
  "phone": 6505551234,
  "photoUrl": "https://example.com/george.jpg",
  "startDate": "2000-01-23",
  "title": "Actor"
}

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.mentionedPeople

object[]

A list of people mentioned in the document.

documents.{key}.metadata.mentionedPeople.name

string

required

The display name.

documents.{key}.metadata.mentionedPeople.obfuscatedId

string

required

An opaque identifier that can be used to request metadata for a Person.

A list of documents related to this person.

documents.{key}.metadata.mentionedPeople.metadata

object

Example:

{
  "department": "Movies",
  "email": "george@example.com",
  "location": "Hollywood, CA",
  "phone": 6505551234,
  "photoUrl": "https://example.com/george.jpg",
  "startDate": "2000-01-23",
  "title": "Actor"
}

documents.{key}.metadata.visibility

enum<string>

The level of visibility of the document as understood by our system.

Available options:

PRIVATE,

SPECIFIC_PEOPLE_AND_GROUPS,

DOMAIN_LINK,

DOMAIN_VISIBLE,

PUBLIC_LINK,

PUBLIC_VISIBLE

documents.{key}.metadata.components

string[]

A list of components this result is associated with. Interpretation is specific to each datasource. (e.g. for Jira issues, these are components.)

documents.{key}.metadata.status

string

The status or disposition of the result. Interpretation is specific to each datasource. (e.g. for Jira issues, this is the issue status such as Done, In Progress or Will Not Fix).

documents.{key}.metadata.statusCategory

string

The status category of the result. Meant to be more general than status. Interpretation is specific to each datasource.

documents.{key}.metadata.pins

object[]

A list of stars associated with this result. "Pin" is an older name.

documents.{key}.metadata.pins.documentId

string

required

The document which should be a pinned result.

documents.{key}.metadata.pins.id

string

The opaque id of the pin.

documents.{key}.metadata.pins.audienceFilters

object[]

Filters which restrict who should see the pinned document. Values are taken from the corresponding filters in people search.

documents.{key}.metadata.pins.attribution

object

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.pins.updatedBy

object

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.pins.createTime

string

documents.{key}.metadata.pins.updateTime

string

documents.{key}.metadata.pins.queries

string[]

The query strings for which the pinned result will show.

documents.{key}.metadata.priority

string

The document priority. Interpretation is datasource specific.

documents.{key}.metadata.assignedTo

object

documents.{key}.metadata.assignedTo.name

string

required

The display name.

documents.{key}.metadata.assignedTo.obfuscatedId

string

required

An opaque identifier that can be used to request metadata for a Person.

A list of documents related to this person.

documents.{key}.metadata.assignedTo.metadata

object

Example:

{
  "department": "Movies",
  "email": "george@example.com",
  "location": "Hollywood, CA",
  "phone": 6505551234,
  "photoUrl": "https://example.com/george.jpg",
  "startDate": "2000-01-23",
  "title": "Actor"
}

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.updatedBy

object

documents.{key}.metadata.updatedBy.name

string

required

The display name.

documents.{key}.metadata.updatedBy.obfuscatedId

string

required

An opaque identifier that can be used to request metadata for a Person.

A list of documents related to this person.

documents.{key}.metadata.updatedBy.metadata

object

Example:

{
  "department": "Movies",
  "email": "george@example.com",
  "location": "Hollywood, CA",
  "phone": 6505551234,
  "photoUrl": "https://example.com/george.jpg",
  "startDate": "2000-01-23",
  "title": "Actor"
}

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.labels

string[]

A list of tags for the document. Interpretation is datasource specific.

documents.{key}.metadata.collections

object[]

A list of collections that the document belongs to.

documents.{key}.metadata.collections.id

integer

required

The unique ID of the Collection.

documents.{key}.metadata.collections.name

string

required

The unique name of the Collection.

documents.{key}.metadata.collections.description

string

required

A brief summary of the Collection's contents.

documents.{key}.metadata.collections.icon

string

The emoji icon of this Collection.

documents.{key}.metadata.collections.adminLocked

boolean

Indicates whether edits are allowed for everyone or only admins.

documents.{key}.metadata.collections.parentId

integer

The parent of this Collection, or 0 if it's a top-level Collection.

documents.{key}.metadata.collections.thumbnail

object

documents.{key}.metadata.collections.allowedDatasource

string

The datasource type this Collection can hold.

documents.{key}.metadata.collections.createTime

string

documents.{key}.metadata.collections.updateTime

string

documents.{key}.metadata.collections.creator

object

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.collections.updatedBy

object

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.collections.itemCount

integer

The number of items currently in the Collection. Separated from the actual items so we can grab the count without items.

documents.{key}.metadata.collections.childCount

integer

The number of children Collections. Separated from the actual children so we can grab the count without children.

documents.{key}.metadata.collections.items

object[]

The items in this Collection.

documents.{key}.metadata.collections.pinMetadata

object

Metadata having what categories this Collection is pinned to and the eligible categories to pin to

documents.{key}.metadata.collections.shortcuts

string[]

The names of the shortcuts (Go Links) that point to this Collection.

documents.{key}.metadata.collections.children

object[]

The children Collections of this Collection.

documents.{key}.metadata.collections.roles

object[]

A list of user roles for the Collection.

documents.{key}.metadata.collections.addedRoles

object[]

A list of added user roles for the Collection.

documents.{key}.metadata.collections.removedRoles

object[]

A list of removed user roles for the Collection.

documents.{key}.metadata.collections.audienceFilters

object[]

Filters which restrict who should see this Collection. Values are taken from the corresponding filters in people search.

documents.{key}.metadata.collections.permissions

object

The permissions the current viewer has with respect to a particular object.

documents.{key}.metadata.datasourceId

string

The user-visible datasource specific id (e.g. Salesforce case number for example, GitHub PR number).

documents.{key}.metadata.interactions

object

documents.{key}.metadata.verification

object

documents.{key}.metadata.viewerInfo

object

documents.{key}.metadata.permissions

object

documents.{key}.metadata.visitCount

object

documents.{key}.metadata.shortcuts

object[]

A list of shortcuts of which destination URL is for the document.

documents.{key}.metadata.shortcuts.inputAlias

string

required

Link text following go/ prefix as entered by the user.

documents.{key}.metadata.shortcuts.alias

string

canonical link text following go/ prefix where hyphen/underscore is removed.

documents.{key}.metadata.shortcuts.title

string

Title for the Go Link

documents.{key}.metadata.shortcuts.roles

object[]

A list of user roles for the Go Link.

documents.{key}.metadata.shortcuts.id

integer

The opaque id of the user generated content.

documents.{key}.metadata.shortcuts.destinationUrl

string

Destination URL for the shortcut.

documents.{key}.metadata.shortcuts.destinationDocumentId

string

Glean Document ID for the URL, if known.

documents.{key}.metadata.shortcuts.description

string

A short, plain text blurb to help people understand the intent of the shortcut.

documents.{key}.metadata.shortcuts.unlisted

boolean

Whether this shortcut is unlisted or not. Unlisted shortcuts are visible to author + admins only.

documents.{key}.metadata.shortcuts.urlTemplate

string

For variable shortcuts, contains the URL template; note, destinationUrl contains default URL.

documents.{key}.metadata.shortcuts.addedRoles

object[]

A list of user roles added for the Shortcut.

documents.{key}.metadata.shortcuts.removedRoles

object[]

A list of user roles removed for the Shortcut.

documents.{key}.metadata.shortcuts.permissions

object

The permissions the current viewer has with respect to a particular object.

documents.{key}.metadata.shortcuts.createdBy

object

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.shortcuts.createTime

string

The time the shortcut was created in ISO format (ISO 8601).

documents.{key}.metadata.shortcuts.updatedBy

object

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.shortcuts.updateTime

string

The time the shortcut was updated in ISO format (ISO 8601).

documents.{key}.metadata.shortcuts.destinationDocument

object

Document that corresponds to the destination URL, if applicable.

documents.{key}.metadata.shortcuts.intermediateUrl

string

The URL from which the user is then redirected to the destination URL. Full replacement for https://go/<inputAlias>.

documents.{key}.metadata.shortcuts.viewPrefix

string

The part of the shortcut preceding the input alias when used for showing shortcuts to users. Should end with "/". e.g. "go/" for native shortcuts.

documents.{key}.metadata.shortcuts.isExternal

boolean

Indicates whether a shortcut is native or external.

documents.{key}.metadata.shortcuts.editUrl

string

The URL using which the user can access the edit page of the shortcut.

documents.{key}.metadata.path

string

For file datasources like onedrive/github etc this has the path to the file

documents.{key}.metadata.customData

object

Custom fields specific to individual datasources

documents.{key}.metadata.documentCategory

string

The document's document_category(.proto).

documents.{key}.metadata.contactPerson

object

documents.{key}.metadata.contactPerson.name

string

required

The display name.

documents.{key}.metadata.contactPerson.obfuscatedId

string

required

An opaque identifier that can be used to request metadata for a Person.

A list of documents related to this person.

documents.{key}.metadata.contactPerson.metadata

object

Example:

{
  "department": "Movies",
  "email": "george@example.com",
  "location": "Hollywood, CA",
  "phone": 6505551234,
  "photoUrl": "https://example.com/george.jpg",
  "startDate": "2000-01-23",
  "title": "Actor"
}

Example:

{
  "name": "George Clooney",
  "obfuscatedId": "abc123"
}

documents.{key}.metadata.thumbnail

object

A thumbnail image representing this document.

documents.{key}.metadata.indexStatus

object

documents.{key}.metadata.ancestors

object[]

A list of documents that are ancestors of this document in the hierarchy of the document's datasource, for example parent folders or containers. Ancestors can be of different types and some may not be indexed. Higher level ancestors appear earlier in the list.

documents.{key}.metadata.ancestors.id

string

The Glean Document ID.

documents.{key}.metadata.ancestors.datasource

string

The app or other repository type from which the document was extracted

documents.{key}.metadata.ancestors.connectorType

enum<string>

The source from which document content was pulled, e.g. an API crawl or browser history

Available options:

API_CRAWL,

BROWSER_CRAWL,

BROWSER_HISTORY,

BUILTIN,

FEDERATED_SEARCH,

PUSH_API,

WEB_CRAWL,

NATIVE_HISTORY

documents.{key}.metadata.ancestors.docType

string

The datasource-specific type of the document (e.g. for Jira issues, this is the issue type such as Bug or Feature Request).

documents.{key}.metadata.ancestors.content

object

documents.{key}.metadata.ancestors.containerDocument

object

documents.{key}.metadata.ancestors.parentDocument

object

documents.{key}.metadata.ancestors.title

string

The title of the document.

documents.{key}.metadata.ancestors.url

string

A permalink for the document.

documents.{key}.metadata.ancestors.metadata

object

Example:

{
  "container": "container",
  "parentId": "JIRA_EN-1337",
  "createTime": "2000-01-23T04:56:07.000Z",
  "datasource": "datasource",
  "author": { "name": "name" },
  "documentId": "documentId",
  "updateTime": "2000-01-23T04:56:07.000Z",
  "mimeType": "mimeType",
  "objectType": "Feature Request",
  "components": ["Backend", "Networking"],
  "status": ["Done"],
  "customData": { "someCustomField": "someCustomValue" }
}

documents.{key}.metadata.ancestors.sections

object[]

A list of content sub-sections in the document, e.g. text blocks with different headings in a Drive doc or Confluence page.

Example:

{
  "container": "container",
  "parentId": "JIRA_EN-1337",
  "createTime": "2000-01-23T04:56:07.000Z",
  "datasource": "datasource",
  "author": { "name": "name" },
  "documentId": "documentId",
  "updateTime": "2000-01-23T04:56:07.000Z",
  "mimeType": "mimeType",
  "objectType": "Feature Request",
  "components": ["Backend", "Networking"],
  "status": ["Done"],
  "customData": { "someCustomField": "someCustomValue" }
}

documents.{key}.sections

object[]

A list of content sub-sections in the document, e.g. text blocks with different headings in a Drive doc or Confluence page.

Was this page helpful?

Suggest edits Raise issue

Read documents by facetsRead the documents including metadata (does not include enhanced metadata via `/documentmetadata`) macthing the given facet conditions.

curl --request POST \
  --url https://{domain}-be.glean.com/rest/api/v1/getdocuments \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "documentSpecs": [
    {
      "url": "<string>"
    }
  ],
  "includeFields": [
    "LAST_VIEWED_AT"
  ]
}'

{
  "documents": {}
}

Indexing API

Client API

Actions API

Authorizations

Headers

Body

Response