knowledge import

Import one or more datasets from an archive (zip) (default: all datasets)

Synopsis

Import one or more datasets from an archive (zip) (default: all datasets).

IMPORTANT: Embedding functions

When someone first ingests some data into a dataset, the embedding provider configured at that time will be attached to the dataset. Upon subsequent ingestion actions, the same embedding provider must be used to ensure that the embeddings are consistent. Most of the times, the only field that has to be the same is the model, as that defines the dimensionality usually. Note: This is only relevant if you plan to add more documents to the dataset after importing it.

knowledge import <path> [<dataset-id>...] [flags]

Options

      --auto-migrate string               Auto migrate database ($KNOW_DB_AUTO_MIGRATE) (default "true")
  -c, --config-file string                Path to the configuration file ($KNOW_CONFIG_FILE)
      --embedding-model-provider string   Embedding model provider ($KNOW_EMBEDDING_MODEL_PROVIDER) (default "openai")
  -h, --help                              help for import
      --index-dsn string                  Index Database Connection string (relational DB) (default "sqlite://$XDG_DATA_HOME/gptscript/knowledge/knowledge.db") ($KNOW_INDEX_DSN)
      --server string                     URL of the Knowledge API Server ($KNOW_SERVER_URL)
      --vector-dsn string                 DSN to the vector database (default "chromem:$XDG_DATA_HOME/gptscript/knowledge/vector.db") ($KNOW_VECTOR_DSN)