Skip to main content

Connectors Overview

Connectors are CoreCube's automated ingestion path. They sync documents from external systems into the evidence layer through scheduled runs and manual triggers.

The three ingestion paths

All knowledge enters CoreCube through one of three paths:

PathDescriptionSource tag
ConnectorsAutomated delta sync from external systemsconnector
LibraryManual document uploads (PDF, Markdown, DOCX, etc.)library
Check-insHuman knowledge via Slack, Teams, email, web form, or APIcheckin

All three converge into the same evidence layer with identical retrieval treatment. The source_path tag enables trust-aware ranking and scope filtering.

Available connectors

Purpose-built integrations with reliable incremental sync, deletion detection, and content normalization.

ConnectorSourceChange detection
ConfluenceAtlassian Cloud spaces and pagesPage version number
JiraAtlassian Cloud projects and issuesIssue updated timestamp
Local FilesA mounted local directoryScan time + file metadata

Each available connector provides:

  • Stable external document IDs
  • Incremental sync (only fetch changed documents)
  • Deletion detection (tombstone documents when upstream deletes)
  • Content normalization (HTML, ADF, PDF, DOCX → clean markdown)

Planned

Additional connectors are planned but not yet available: Notion, Microsoft 365, GitHub, GitLab, Bitbucket, Google Drive, Slack, HubSpot, and Nextcloud. An MCP-server connector — ingesting selected resources and read-only tools from an external MCP server — is coming soon.

Compartments and access control

Every connection belongs to exactly one compartment and has a sensitivity level. Together they form the connection's security label (e.g., hr/confidential).

Compartments

Admin-defined organizational boundaries — teams, departments, or functional areas:

executive rnd hr finance engineering legal all-staff

A connection's compartment cannot be changed after creation. To reclassify, delete and recreate the connection.

Sensitivity levels

LevelDescriptionExample
publicSafe for anyone in the organizationPublic docs, marketing
internalGeneral internal, not externally shareableTeam wikis, project docs
confidentialSensitive, restricted accessFinancial reports, HR records
restrictedHighly sensitive, need-to-knowM&A docs, legal matters

Source filtering

First-class connectors support filtering at connection setup so you can create narrow, compartment-appropriate connections instead of one broad connection:

✓ "Confluence — Engineering Docs" compartment: engineering sensitivity: internal
Space keys: ENG, DEVOPS

✓ "Confluence — HR Policies" compartment: hr sensitivity: confidential
Space keys: HR

✓ "Confluence — Company Handbook" compartment: all-staff sensitivity: public
Space keys: HANDBOOK

The connection form surfaces these filters prominently with a guidance message: "Select which parts of this source to ingest. Create separate connections for content with different sensitivity levels."

Source trust levels

Trust levelDescriptionExample sources
AuthoritativeOfficial, maintained documentationRunbooks, approved policies, official docs
ReferenceUseful context that may be informalWiki pages, shared notes, meeting summaries
VolatileRapidly changing or unverifiedChat exports, draft documents, ticket comments

Default: reference. Trust level is used as a tie-breaker in search ranking when chunk scores are equal.

Sync schedule

Each connection has a configurable sync interval (e.g., every 15 minutes, every hour, daily).

Manual sync: Click Sync Now in the connection detail view to trigger an immediate sync.

Connection health

The Admin Console shows real-time connection health:

StatusMeaning
HealthyLast sync completed within the expected window
DegradedLast sync had partial failures or is overdue
OfflineCannot reach the source or authentication failed

Connector metrics

Per-connection metrics available in the connection detail view:

MetricDescription
documents_foundTotal documents discovered during the sync
documents_newNewly ingested documents
documents_updatedExisting documents re-ingested after a change
documents_skippedUnchanged documents not re-processed
documents_failedDocuments that failed extraction or ingestion
chunks_createdEvidence chunks created during the sync
chunks_deletedEvidence chunks removed during the sync
statusSync outcome: completed, partial_ok, or failed
trigger_typeWhat started the sync: scheduled or manual
error_messageMost recent error summary, if any
error_detailsStructured per-document error details

Resource limits

ResourceLimitBehavior when exceeded
Document content size50 MB rawDocument fails and is counted in documents_failed; the sync continues

We use cookies for analytics to improve our website. More information in our Privacy Policy.