You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| pgvector | vector | vector similarity search (requires [pgvector](https://github.com/pgvector/pgvector) extension and the `pgvector` Python package) |
320
321
321
322
### Using the Singer catalog to narrow down the Postgres data types
322
323
@@ -350,6 +351,36 @@ plugins:
350
351
x-sql-datatype: smallint
351
352
```
352
353
354
+
For vector embeddings:
355
+
356
+
```yaml
357
+
# meltano.yml
358
+
plugins:
359
+
extractors:
360
+
- name: tap-my-tap
361
+
schema:
362
+
some_stream_id:
363
+
embedding:
364
+
type: array
365
+
items:
366
+
type: number
367
+
x-sql-datatype: pgvector
368
+
```
369
+
370
+
**Important:** To use `pgvector` data types:
371
+
1. The [pgvector extension](https://github.com/pgvector/pgvector) **MUST** be installed and enabled in your PostgreSQL database:
372
+
```sql
373
+
CREATE EXTENSION IF NOT EXISTS vector;
374
+
```
375
+
2. The `pgvector` Python package **MUST** be installed in your environment:
376
+
```bash
377
+
pip install pgvector
378
+
# or with the target
379
+
pip install meltanolabs-target-postgres pgvector
380
+
```
381
+
382
+
If the `pgvector` Python package is not installed, the target will fall back to using `ARRAY(INTEGER)` with a warning.
383
+
353
384
## Content Encoding Support
354
385
355
386
Json Schema supports the [`contentEncoding` keyword](https://datatracker.ietf.org/doc/html/rfc4648#section-8), which can be used to specify the encoding of input string types.
0 commit comments