Skip to content
How to insert pre-c...
 
Notifications
Clear all

How to insert pre-chunked data into a vector store

4 Posts
3 Users
0 Reactions
4 Views
MehrCurry
(@mehrcurry)
Posts: 2
New Member
Topic starter
 

I have an external web service that processes PDF files and generates chunks using langchain, returning these chunks as JSON.

{
  "document": "This is a simple PDF file. Fun fun fun.nLorem ipsum dolor sit amet,  consectetuer adipiscing elit. Phasellus facilisis odio sed mi. Curabitur suscipit. Nullam vel nisi. Etiam semper ipsum ut lectus. Proin aliquam, erat eget pharetra  commodo,  eros  mi  condimentum quam,  sed  commodo  justo  quam  ut  velit. Integer  a  erat. Cras  laoreet  ligula  cursus  enim. Aenean  scelerisque  velit  et  tellus. Vestibulum dictum aliquet sem.  Nulla facilisi.  Vestibulum accumsan  ante  vitae  elit.  Nulla erat  dolor,  blandit  in,  rutrum  quis,  semper  pulvinar,  enim.  Nullam varius  congue  risus. Vivamus  sollicitudin,  metus  ut  interdum  eleifend,  nisi  tellus  pellentesque  elit,  tristique accumsan  eros  quam et  risus.  Suspendisse  libero  odio,  mattis  sit  amet,  aliquet  eget, hendrerit vel,  nulla. Sed vitae augue. Aliquam erat volutpat. Aliquam feugiat vulputate nisl. Suspendisse quis nulla pretium ante pretium mollis. Proin velit ligula, sagittis at, egestas a, pulvinar quis, nisl.",
  "metadata": {
    "source": "sample.pdf",
    "chunk_index": 0,
    "chunk_type": null,
    "chunk_text": "Lorem ipsum dolor sit amet,  consectetuer adipiscing elit. Phasellus facilisis odio sed mi. Curabitur suscipit. Nullam vel nisi. Etiam semper ipsum ut lectus. Proin aliquam, erat eget pharetra  commodo,  eros  mi  condimentum quam,  sed  commodo  justo  quam  ut  velit. Integer  a  erat. Cras  laoreet  ligula  cursus  enim. Aenean  scelerisque  velit  et  tellus. Vestibulum dictum aliquet sem.  Nulla facilisi.  Vestibulum accumsan  ante  vitae  elit.  Nulla erat  dolor,  blandit  in,  rutrum  quis,  semper  pulvinar,  enim.  Nullam varius  congue  risus. Vivamus  sollicitudin,  metus  ut  interdum  eleifend,  nisi  tellus  pellentesque  elit,  tristique accumsan  eros  quam et  risus.  Suspendisse  libero  odio,  mattis  sit  amet,  aliquet  eget, hendrerit vel,  nulla. Sed vitae augue. Aliquam erat volutpat. Aliquam feugiat vulputate nisl. Suspendisse quis nulla pretium ante pretium mollis. Proin velit ligula, sagittis at, egestas a, pulvinar quis, nisl.",
    "content_type": "application/pdf",
    "size": 18810
  },
  "chunks": [
    {
      "text": "This is a simple PDF file. Fun fun fun.nPellentesque  sit  amet  lectus.  Praesent  pulvinar,  nunc  quis  iaculis  sagittis,  justo  quam lobortis tortor,  sed  vestibulum dui metus venenatis est.  Nunc  cursus ligula. Nulla facilisi. Phasellus ullamcorper consectetuer ante. Duis tincidunt, urna id condimentum luctus, nibh ante  vulputate  sapien,  id  sagittis  massa orci  ut  enim.  Pellentesque  vestibulum convallis sem. Nulla consequat quam ut nisl.  Nullam est.  Curabitur tincidunt dapibus lorem.  Proin velit  turpis,  scelerisque  sit  amet,  iaculis  nec,  rhoncus  ac,  ipsum.  Phasellus  lorem arcu, feugiat  eu,  gravida  eu,  consequat  molestie,  ipsum.  Nullam  vel  est  ut  ipsum  volutpat feugiat. Aenean pellentesque.",
      "metadata": {
        "source": "sample.pdf",
        "chunk_index": 0,
        "chunk_type": null,
        "chunk_text": "P

I am using an HTTP Request node to receive the JSON string. I need to find a way to inject these chunks into a vector store within callin.io. The vector store nodes in callin.io require a data loader and a splitter, which are already configured.

Is it possible to utilize my pre-chunked data?

Information on your callin.io setup

  • callin.io version: 1.79.2
  • Running callin.io via (Docker, npm, callin.io cloud, desktop app): docker
  • Operating system: MacOS
 
Posted : 21/02/2025 7:04 pm
n8n
 n8n
(@n8n)
Posts: 97
Trusted Member
 

It appears your topic is missing some crucial details. Could you please provide the following information, if relevant?

  • callin.io version:
  • Database (default: SQLite):
  • callin.io EXECUTIONS_PROCESS setting (default: own, main):
  • Running callin.io via (Docker, npm, callin.io cloud, desktop app):
  • Operating system:

Please provide the requested details.

 
Posted : 21/02/2025 7:04 pm
MehrCurry
(@mehrcurry)
Posts: 2
New Member
Topic starter
 

After an additional hour of investigation and experimentation, I've discovered the solution.

 
Posted : 21/02/2025 7:50 pm
system
(@system)
Posts: 332
Reputable Member
 

This discussion was automatically closed 7 days following the last response. New replies are no longer permitted.

 
Posted : 04/03/2025 9:04 am
Share: