From acbf4a65b696aa86b5c6153e026a40abcf695ecf Mon Sep 17 00:00:00 2001
From: Han Xiao <han.xiao@jina.ai>
Date: Thu, 22 Jun 2023 23:21:58 +0800
Subject: [PATCH 1/5] feat: add docarray version to push

Signed-off-by: Han Xiao <han.xiao@jina.ai>
---
 .github/ISSUE_TEMPLATE/bug-v1-deprecated.yml |   2 +-
 README.md                                    | 132 +++++++++----------
 docarray/array/doc_list/io.py                |   2 +-
 docarray/documents/legacy/legacy_document.py |   4 +-
 docs/assets/docarray-colorful.svg            |  16 +++
 docs/how_to/add_doc_index.md                 |   2 +-
 docs/migration_guide.md                      |   6 +-
 docs/user_guide/representing/array.md        |   2 +-
 docs/user_guide/sending/api/jina.md          |   2 +-
 docs/user_guide/sending/first_step.md        |   2 +-
 docs/user_guide/storing/first_step.md        |   2 +-
 mkdocs.yml                                   |   2 +-
 12 files changed, 95 insertions(+), 79 deletions(-)
 create mode 100644 docs/assets/docarray-colorful.svg
diff --git a/.github/ISSUE_TEMPLATE/bug-v1-deprecated.yml b/.github/ISSUE_TEMPLATE/bug-v1-deprecated.yml
index 885c37877b9..ecedf100fc6 100644
--- a/.github/ISSUE_TEMPLATE/bug-v1-deprecated.yml
+++ b/.github/ISSUE_TEMPLATE/bug-v1-deprecated.yml
@@ -1,4 +1,4 @@
-name: 🐛 DocArray V1 Bug (0.1.0 - 0.20.1) (Deprecated Version)
+name: 🐛 DocArray <=0.21 Bug (0.1.0 - 0.20.1) (Deprecated Version)
 description: Report a bug or unexpected behavior in DocArray version prior to v2 (0.21.1)
 labels: [bug V1, unconfirmed]
 
diff --git a/README.md b/README.md
index 0d60bbfdda2..b9c286c5ab4 100644
--- a/README.md
+++ b/README.md
@@ -12,25 +12,31 @@
 <a href="https://discord.gg/WaMp6PVPgR"><img src="https://dcbadge.vercel.app/api/server/WaMp6PVPgR?theme=default-inverted&style=flat-square"></a>
 </p>
 
-> ⬆️ **DocArray v2**: This readme is for the second version of DocArray (starting at 0.30). If you want to use the older
-> version (prior to 0.30) check out the [docarray-v1-fixes](https://github.com/docarray/docarray/tree/docarray-v1-fixes) branch
+> > **Note**
+> The README you're currently viewing is for DocArray>0.30, which introduces some significant changes from DocArray 0.21. If you wish to continue using the older DocArray <=0.21, ensure you install it via `pip install docarray==0.21`. Refer to its [codebase](https://github.com/docarray/docarray/tree/v0.21.0), [documentation](https://docarray.jina.ai), and [its hot-fixes branch](https://github.com/docarray/docarray/tree/docarray-v1-fixes) for more information.
 
-DocArray is a library for **representing, sending and storing multi-modal data**, perfect for **Machine Learning applications**.
 
-With DocArray you can:
+DocArray is a Python library expertly crafted for the [representation](#represent), [transmission](#send), [storage](#store), and [retrieval](#retrieve) of multimodal data. Tailored for the development of multimodal AI applications, its design guarantees seamless integration with the extensive Python and machine learning ecosystems. As of January 2022, DocArray is openly distributed under the [Apache License 2.0](https://github.com/docarray/docarray/blob/main/LICENSE) and currently enjoys the status of a sandbox project within the [LF AI & Data Foundation](https://lfaidata.foundation/).
 
-1. [**Represent data**](#represent)
-2. [**Send data**](#send)
-3. [**Store data**](#store)
 
-DocArray handles your data while integrating seamlessly with the rest of your **Python and ML ecosystem**:
 
-- :fire: Native compatibility for **[NumPy](https://github.com/numpy/numpy)**, **[PyTorch](https://github.com/pytorch/pytorch)** and **[TensorFlow](https://github.com/tensorflow/tensorflow)**, including for **model training use cases**
-- :zap: Built on **[Pydantic](https://github.com/pydantic/pydantic)** and out-of-the-box compatible with **[FastAPI](https://github.com/tiangolo/fastapi/)** and **[Jina](https://github.com/jina-ai/jina/)**
-- :package: Support for vector databases like **[Weaviate](https://weaviate.io/), [Qdrant](https://qdrant.tech/), [ElasticSearch](https://www.elastic.co/de/elasticsearch/)** and **[HNSWLib](https://github.com/nmslib/hnswlib)**
-- :chains: Send data as JSON over **HTTP** or as **[Protobuf](https://protobuf.dev/)** over **[gRPC](https://grpc.io/)**
+- :fire: Offers native support for **[NumPy](https://github.com/numpy/numpy)**, **[PyTorch](https://github.com/pytorch/pytorch)**, and **[TensorFlow](https://github.com/tensorflow/tensorflow)**, catering specifically to **model training scenarios**.
+- :zap: Based on **[Pydantic](https://github.com/pydantic/pydantic)**, and instantly compatible with web and microservice frameworks like **[FastAPI](https://github.com/tiangolo/fastapi/)** and **[Jina](https://github.com/jina-ai/jina/)**.
+- :package: Provides support for vector databases such as **[Weaviate](https://weaviate.io/), [Qdrant](https://qdrant.tech/), [ElasticSearch](https://www.elastic.co/de/elasticsearch/)**, and **[HNSWLib](https://github.com/nmslib/hnswlib)**.
+- :chains: Allows data transmission as JSON over **HTTP** or as **[Protobuf](https://protobuf.dev/)** over **[gRPC](https://grpc.io/)**.
 
-> :bulb: **Where are you coming from?** Based on your use case and background, there are different ways to understand DocArray:
+## Installation
+
+To install DocArray from the CLI, run the following command:
+
+```shell
+pip install -U docarray
+```
+
+> > **Note**
+> To use DocArray <=0.21, make sure you install via `pip install docarray==0.21` and check out its [codebase](https://github.com/docarray/docarray/tree/v0.21.0) and [docs](https://docarray.jina.ai) and [its hot-fixes branch](https://github.com/docarray/docarray/tree/docarray-v1-fixes).
+
+> :bulb: **New to DocArray?** Depending on your use case and background, there are multiple ways to learn about DocArray:
 > 
 > - [Coming from pure PyTorch or TensorFlow](#coming-from-pytorch)
 > - [Coming from Pydantic](#coming-from-pydantic)
@@ -38,23 +44,23 @@ DocArray handles your data while integrating seamlessly with the rest of your **
 > - [Coming from a vector database](#coming-from-vector-database)
 > - [Coming from Langchain](#coming-from-langchain)
 
-DocArray has been distributed under the open-source [Apache License 2.0](https://github.com/docarray/docarray/blob/main/LICENSE) since January 2022. It is currently a sandbox project under [LF AI & Data Foundation](https://lfaidata.foundation/).
 
 ## Represent
 
-DocArray allows you to **represent your data**, in an ML-native way.
+DocArray empowers you to **represent your data** in a manner that is inherently attuned to machine learning.
 
-This is useful for different use cases:
+This is particularly beneficial for various scenarios:
 
-- :running: You are **training a model**: There are tensors of different shapes and sizes flying around, representing different _things_, and you want to keep a straight head about them.
-- :cloud: You are **serving a model**: For example through FastAPI, and you want to specify your API endpoints.
-- :card_index_dividers: You are **parsing data**: For later use in your ML or data science applications.
+- :running: You are **training a model**: You're dealing with tensors of varying shapes and sizes, each signifying different elements. You desire a method to logically organize them.
+- :cloud: You are **serving a model**: Let's say through FastAPI, and you wish to define your API endpoints precisely.
+- :card_index_dividers: You are **parsing data**: Perhaps for future deployment in your machine learning or data science projects.
 
-> :bulb: **Coming from Pydantic?** You should be happy to hear
-> that DocArray is built on top of, and is fully compatible with, Pydantic!
-> Also, we have a [dedicated section](#coming-from-pydantic) just for you!
+> :bulb: **Familiar with Pydantic?** You'll be pleased to learn
+> that DocArray is not only constructed atop Pydantic but also maintains complete compatibility with it!
+> Furthermore, we have a [specific section](#coming-from-pydantic) dedicated to your needs!
+
+In essence, DocArray facilitates data representation in a way that mirrors Python dataclasses, with machine learning being an integral component:
 
-Put simply, DocArray lets you represent your data in a dataclass-like way, with ML as a first class citizen:
 
 ```python
 from docarray import BaseDoc
@@ -256,21 +262,22 @@ assert isinstance(dl_2, DocList)
 
 ## Send
 
-DocArray allows you to **send your data** in an ML-native way.
+DocArray facilitates the **transmission of your data** in a manner inherently compatible with machine learning.
+
+This includes native support for **Protobuf and gRPC**, along with **HTTP** and serialization to JSON, JSONSchema, Base64, and Bytes.
 
-This means there is native support for **Protobuf and gRPC**, on top of **HTTP** and serialization to JSON, JSONSchema, Base64, and Bytes.
+This feature proves beneficial for several scenarios:
 
-This is useful for different use cases:
+- :cloud: You are **serving a model**, perhaps through frameworks like **[Jina](https://github.com/jina-ai/jina/)** or **[FastAPI](https://github.com/tiangolo/fastapi/)**
+- :spider_web: You are **distributing your model** across multiple machines and need an efficient means of transmitting your data between nodes
+- :gear: You are architecting a **microservice** environment and require a method for data transmission between microservices
 
-- :cloud: You are **serving a model**, for example through **[Jina](https://github.com/jina-ai/jina/)** or **[FastAPI](https://github.com/tiangolo/fastapi/)**
-- :spider_web: You are **distributing your model** across machines and need to send your data between nodes
-- :gear: You are building a **microservice** architecture and need to send your data between microservices
+> :bulb: **Are you familiar with FastAPI?** You'll be delighted to learn
+> that DocArray maintains full compatibility with FastAPI!
+> Plus, we have a [dedicated section](#coming-from-fastapi) specifically for you!
 
-> :bulb: **Coming from FastAPI?** You should be happy to hear
-> that DocArray is fully compatible with FastAPI!
-> Also, we have a [dedicated section](#coming-from-fastapi) just for you!
+When it comes to data transmission, serialization is a crucial step. Let's delve into how DocArray streamlines this process:
 
-Whenever you want to send your data, you need to serialize it, so let's take a look at how that works with DocArray:
 
 ```python
 from docarray import BaseDoc
@@ -305,18 +312,14 @@ Of course, serialization is not all you need. So check out how DocArray integrat
 
 ## Store
 
-Once you've modelled your data, and maybe sent it around, usually you want to **store it** somewhere.
-DocArray has you covered!
+After modeling and possibly distributing your data, you'll typically want to **store it** somewhere. That's where DocArray steps in!
 
-**Document Stores** let you, well, store your Documents, locally or remotely, all with the same user interface:
+**Document Stores** provide a seamless way to, as the name suggests, store your Documents. Be it locally or remotely, you can do it all through the same user interface:
 
-- :cd: **On disk** as a file in your local file system
+- :cd: **On disk**, as a file in your local filesystem
 - :bucket: On **[AWS S3](https://aws.amazon.com/de/s3/)**
 - :cloud: On **[Jina AI Cloud](https://cloud.jina.ai/)**
 
-<details markdown="1">
-  <summary>See Document Store usage</summary>
-
 The Document Store interface lets you push and pull Documents to and from multiple data sources, all with the same user interface.
 
 For example, let's see how that works with on-disk storage:
@@ -334,7 +337,8 @@ docs.push('file://simple_docs')
 
 docs_pull = DocList[SimpleDoc].pull('file://simple_docs')
 ```
-</details>
+
+## Retrieve
 
 **Document Indexes** let you index your Documents in a **vector database** for efficient similarity-based retrieval.
 
@@ -346,9 +350,6 @@ This is useful for:
 
 Currently, Document Indexes support **[Weaviate](https://weaviate.io/)**, **[Qdrant](https://qdrant.tech/)**, **[ElasticSearch](https://www.elastic.co/)**, and **[HNSWLib](https://github.com/nmslib/hnswlib)**, with more to come!
 
-<details markdown="1">
-  <summary>See Document Index usage</summary>
-
 The Document Index interface lets you index and retrieve Documents from multiple vector databases, all with the same user interface.
 
 It supports ANN vector search, text search, filtering, and hybrid search.
@@ -391,18 +392,21 @@ query = dl[0]
 results, scores = index.find(query, limit=10, search_field='embedding')
 ```
 
-</details>
+
+---
+
+## Learn DocArray
 
 Depending on your background and use case, there are different ways for you to understand DocArray.
 
-## Coming from old DocArray
+### Coming from DocArray <=0.21
 
 <details markdown="1">
   <summary>Click to expand</summary>
 
 If you are using DocArray version 0.30.0 or lower, you will be familiar with its [dataclass API](https://docarray.jina.ai/fundamentals/dataclass/).
 
-_DocArray v2 is that idea, taken seriously._ Every document is created through a dataclass-like interface,
+_DocArray >=0.30 is that idea, taken seriously._ Every document is created through a dataclass-like interface,
 courtesy of [Pydantic](https://pydantic-docs.helpmanual.io/usage/models/).
 
 This gives the following advantages:
@@ -420,7 +424,7 @@ For now, Document Indexes support **[Weaviate](https://weaviate.io/)**, **[Qdran
 
 </details>
 
-## Coming from Pydantic
+### Coming from Pydantic
 
 <details markdown="1">
   <summary>Click to expand</summary>
@@ -497,7 +501,7 @@ except Exception as e:
 
 </details>
 
-## Coming from PyTorch
+### Coming from PyTorch
 
 <details markdown="1">
   <summary>Click to expand</summary>
@@ -511,7 +515,7 @@ It offers you several advantages:
 - **Go directly to deployment**, by re-using your data model as a [FastAPI](https://fastapi.tiangolo.com/) or [Jina](https://github.com/jina-ai/jina) API schema
 - Connect model components between **microservices**, using Protobuf and gRPC
 
-DocArray can be used directly inside ML models to handle and represent multi-modal data.
+DocArray can be used directly inside ML models to handle and represent multimodaldata.
 This allows you to reason about your data using DocArray's abstractions deep inside of `nn.Module`,
 and provides a FastAPI-compatible schema that eases the transition between model training and model serving.
 
@@ -609,7 +613,7 @@ schema definition (see [below](#coming-from-fastapi)). Everything is handled in
 </details>
 
 
-## Coming from TensorFlow
+### Coming from TensorFlow
 
 <details markdown="1">
   <summary>Click to expand</summary>
@@ -657,7 +661,7 @@ class MyPodcastModel(tf.keras.Model):
 
 </details>
 
-## Coming from FastAPI
+### Coming from FastAPI
 
 <details markdown="1">
   <summary>Click to expand</summary>
@@ -680,6 +684,7 @@ from docarray import BaseDoc
 from docarray.documents import ImageDoc
 from docarray.typing import NdArray
 
+
 class InputDoc(BaseDoc):
     img: ImageDoc
     text: str
@@ -692,12 +697,15 @@ class OutputDoc(BaseDoc):
 
 app = FastAPI()
 
+
 def model_img(img: ImageTensor) -> NdArray:
     return np.zeros((100, 1))
 
+
 def model_text(text: str) -> NdArray:
     return np.zeros((100, 1))
 
+
 @app.post("/embed/", response_model=OutputDoc, response_class=DocArrayResponse)
 async def create_item(doc: InputDoc) -> OutputDoc:
     doc = OutputDoc(
@@ -705,16 +713,16 @@ async def create_item(doc: InputDoc) -> OutputDoc:
     )
     return doc
 
+
 async with AsyncClient(app=app, base_url="http://test") as ac:
     response = await ac.post("/embed/", data=input_doc.json())
-
 ```
 
 Just like a vanilla Pydantic model!
 
 </details>
 
-## Coming from a vector database
+### Coming from a vector database
 
 <details markdown="1">
   <summary>Click to expand</summary>
@@ -770,14 +778,14 @@ Currently, DocArray supports the following vector databases:
 
 An integration of [OpenSearch](https://opensearch.org/) is currently in progress.
 
-Legacy versions of DocArray also support [Redis](https://redis.io/) and [Milvus](https://milvus.io/), but these are not yet supported in the current version.
+DocArray <=0.21 also support [Redis](https://redis.io/) and [Milvus](https://milvus.io/), but these are not yet supported in the current version.
 
 Of course this is only one of the things that DocArray can do, so we encourage you to check out the rest of this readme!
 
 </details>
 
 
-## Coming from Langchain
+### Coming from Langchain
 
 <details markdown="1">
   <summary>Click to expand</summary>
@@ -835,7 +843,6 @@ db = InMemoryExactNNIndex[MovieDoc](docs)
 
 3. Finally, initialize a retriever and integrate it into your chain!
 ```python
-
 from langchain.chat_models import ChatOpenAI
 from langchain.chains import ConversationalRetrievalChain
 from langchain.retrievers import DocArrayRetriever
@@ -859,20 +866,13 @@ Both are user-friendly and are best suited to small to medium-sized datasets.
 
 </details>
 
-## Installation
-
-To install DocArray from the CLI, run the following command:
-
-```shell
-pip install -U docarray
-```
 
 ## See also
 
 - [Documentation](https://docs.docarray.org)
+- [DocArray<=0.21 documentation](https://docarray.jina.ai/)
 - [Join our Discord server](https://discord.gg/WaMp6PVPgR)
 - [Donation to Linux Foundation AI&Data blog post](https://jina.ai/news/donate-docarray-lf-for-inclusive-standard-multimodal-data-model/)
-- ["Legacy" DocArray github page](https://github.com/docarray/docarray/tree/docarray-v1-fixes)
-- ["Legacy" DocArray documentation](https://docarray.jina.ai/)
+
 
 > DocArray is a trademark of LF AI Projects, LLC
diff --git a/docarray/array/doc_list/io.py b/docarray/array/doc_list/io.py
index c2b531c2550..9667c673c09 100644
--- a/docarray/array/doc_list/io.py
+++ b/docarray/array/doc_list/io.py
@@ -555,7 +555,7 @@ def _stream_header(self) -> bytes:
         # Binary format for streaming case
 
         # V2 DocList streaming serialization format
-        # | 1 byte | 8 bytes | 4 bytes | variable(docarray v2) | 4 bytes | variable(docarray v2) ...
+        # | 1 byte | 8 bytes | 4 bytes | variable(DocArray >=0.30) | 4 bytes | variable(DocArray >=0.30) ...
 
         # 1 byte (uint8)
         version_byte = b'\x02'
diff --git a/docarray/documents/legacy/legacy_document.py b/docarray/documents/legacy/legacy_document.py
index eea42f1d93e..74a105fbcfe 100644
--- a/docarray/documents/legacy/legacy_document.py
+++ b/docarray/documents/legacy/legacy_document.py
@@ -8,10 +8,10 @@
 
 class LegacyDocument(BaseDoc):
     """
-    This Document is the LegacyDocument. It follows the same schema as in DocArray v1.
+    This Document is the LegacyDocument. It follows the same schema as in DocArray <=0.21.
     It can be useful to start migrating a codebase from v1 to v2.
 
-    Nevertheless, the API is not totally compatible with DocArray v1 `Document`.
+    Nevertheless, the API is not totally compatible with DocArray <=0.21 `Document`.
     Indeed, none of the method associated with `Document` are present. Only the schema
     of the data is similar.
 
diff --git a/docs/assets/docarray-colorful.svg b/docs/assets/docarray-colorful.svg
new file mode 100644
index 00000000000..ed803d09d56
--- /dev/null
+++ b/docs/assets/docarray-colorful.svg
@@ -0,0 +1,16 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<svg width="320px" height="320px" viewBox="0 0 320 320" version="1.1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
+    <title>docarray-colorful 2</title>
+    <g id="docarray-colorful-2" stroke="none" stroke-width="1" fill="none" fill-rule="evenodd">
+        <g id="编组-58" transform="translate(31.000000, 31.000000)">
+            <g id="编组备份" fill="#009191" fill-rule="nonzero">
+                <g id="编组-57" transform="translate(129.000000, 129.000000) scale(-1, 1) translate(-129.000000, -129.000000) ">
+                    <path d="M165,10 C165,15.5228475 160.522847,20 155,20 L129,20 L129,20 C69.9989231,20 21.9467921,66.8780096 20.0576823,125.419953 L20.0146029,127.197486 L20,129 C20,188.597047 67.8298231,237.022841 127.197486,237.985397 L155.000001,237.996137 C160.523473,237.998271 165,242.476528 165,248 C165,253.522847 160.522847,258 155,258 L129,258 L129,258 C58.4677146,258 1.15645429,201.394063 0.0172823144,131.133251 L0,129 C0,58.4677146 56.6059375,1.15645429 126.866749,0.0172823144 L155.000001,0.00453208478 C160.520346,0.0020302176 164.997497,4.47512547 164.999999,9.99546997 C165,9.99697998 165,9.99848999 165,10 Z M240,0 C249.830669,0 257.820682,7.88078333 257.997024,17.6693348 L258,18 L258,248 C258,253.42924 253.673329,257.847932 248.279905,257.996158 L248,258 L204,258 C198.477153,258 194,253.522847 194,248 C194,242.477153 198.477153,238 204,238 L238,238 L238,238 L238,20 L204,20 C198.477153,20 194,15.5228475 194,10 C194,4.4771525 198.477153,1.01453063e-15 204,0 L240,0 L240,0 Z" id="形状"></path>
+                </g>
+            </g>
+            <path d="M116.688312,65 L116.864584,65.0050927 C118.439392,65.0963391 119.688312,66.4023191 119.688312,68 L119.689,117.855 L119.671078,117.851234 C119.27284,117.766119 118.881185,117.68394 118.496113,117.604696 L117.360647,117.375768 C117.174694,117.339081 116.990387,117.303128 116.807726,117.267908 L115.731507,117.065395 C115.555429,117.03311 115.380996,117.001559 115.208209,116.970742 L114.191238,116.794644 L114.191238,116.794644 L113.213766,116.636155 C113.054145,116.611208 112.896171,116.586994 112.739842,116.563515 L111.821617,116.431441 L111.821617,116.431441 L110.94289,116.316977 C109.224936,116.105659 107.743973,116 106.5,116 L105.900502,116.003729 C93.5267239,116.157701 81.9475147,121.07369 73.3035829,129.525108 L72.825,129.999 L57.688262,130 C56.0314574,130 54.6883117,128.656854 54.6883117,127 L54.6883117,68 C54.6883117,66.3431458 56.0314574,65 57.6883117,65 L116.688312,65 Z" id="路径备份" fill="#FF455A" fill-rule="nonzero"></path>
+            <path d="M106.5,126 C112.504513,126 118.179413,127.411237 123.211113,129.920123 L120.95949,133.742705 C116.753408,140.879792 119.129458,150.075249 126.266546,154.281331 C128.574151,155.641266 131.203812,156.35849 133.882331,156.35849 L143.320821,156.358558 C143.766575,158.670456 144,161.057968 144,163.5 C144,184.210678 127.210678,201 106.5,201 C85.7893219,201 69,184.210678 69,163.5 C69,142.789322 85.7893219,126 106.5,126 Z" id="路径备份-2" fill="#009191"></path>
+            <path d="M168.245089,88.0242111 L196.633659,138.92297 C197.048941,139.667543 197.266939,140.505936 197.266939,141.35849 C197.266939,144.119914 195.028363,146.35849 192.266939,146.35849 L133.882331,146.35849 C132.989491,146.35849 132.112938,146.119415 131.343736,145.666104 C128.964707,144.264076 128.17269,141.198924 129.574718,138.819895 L159.570756,87.9211357 C160.021445,87.1563834 160.667604,86.525401 161.44285,86.0930109 C163.854522,84.7479105 166.899989,85.6125382 168.245089,88.0242111 Z" id="路径-73备份" fill="#FFC92A" fill-rule="nonzero"></path>
+        </g>
+    </g>
+</svg>
\ No newline at end of file
diff --git a/docs/how_to/add_doc_index.md b/docs/how_to/add_doc_index.md
index 28eb5b5e128..facadaefb06 100644
--- a/docs/how_to/add_doc_index.md
+++ b/docs/how_to/add_doc_index.md
@@ -384,7 +384,7 @@ When indexing documents, your implementation should behave in the following way:
 - Every field in the Document is mapped to a column in the database
 - This includes the `id` field, which is mapped to the primary key of the database (if your backend has such a concept)
 - The configuration of that column can be found in `self._column_infos[field_name].config`
-- In DocArray v1, we used to store a serialized representation of every document. This is not needed anymore, as every row in your database table should fully represent a single indexed document.
+- In DocArray <=0.21, we used to store a serialized representation of every document. This is not needed anymore, as every row in your database table should fully represent a single indexed document.
 
 To handle nested documents, the public `index()` method already flattens every incoming document for you.
 This means that `_index()` already receives a flattened representation of the data, and you don't need to worry about that.
diff --git a/docs/migration_guide.md b/docs/migration_guide.md
index ab347f5eac2..2609deecf38 100644
--- a/docs/migration_guide.md
+++ b/docs/migration_guide.md
@@ -2,7 +2,7 @@
 
 If you are using DocArray v<0.30.0, you will be familiar with its [dataclass API](https://docarray.jina.ai/fundamentals/dataclass/).
 
-_DocArray v2 is that idea, taken seriously._ Every document is created through a dataclass-like interface,
+_DocArray >=0.30 is that idea, taken seriously._ Every document is created through a dataclass-like interface,
 courtesy of [Pydantic](https://pydantic-docs.helpmanual.io/usage/models/).
 
 This gives the following advantages:
@@ -33,7 +33,7 @@ and additional `chunks` and `matches`.
 - In v2 we have the [`LegacyDocument`][docarray.documents.legacy.LegacyDocument] class, 
   which extends `BaseDoc` while following the same schema as v1's `Document`.
   The `LegacyDocument` can be useful to start migrating your codebase from v1 to v2. 
-  Nevertheless, the API is not fully compatible with DocArray v1 `Document`.
+  Nevertheless, the API is not fully compatible with DocArray <=0.21 `Document`.
   Indeed, none of the methods associated with `Document` are present. 
   Only the schema of the data is similar.
 
@@ -100,7 +100,7 @@ book_titles = docs.title  # returns a list[str]
 ## Changes to Document Store
 
 In v2 the `Document Store` has been renamed to [`DocIndex`](user_guide/storing/docindex.md) and can be used for fast retrieval using vector similarity. 
-DocArray v2 `DocIndex` supports:
+DocArray >=0.30 `DocIndex` supports:
 
 - [Weaviate](https://weaviate.io/)
 - [Qdrant](https://qdrant.tech/)
diff --git a/docs/user_guide/representing/array.md b/docs/user_guide/representing/array.md
index 9a07d649d5a..c659461d1bd 100644
--- a/docs/user_guide/representing/array.md
+++ b/docs/user_guide/representing/array.md
@@ -1,6 +1,6 @@
 # Array of documents
 
-DocArray allows users to represent and manipulate multi-modal data to build AI applications such as neural search and generative AI. 
+DocArray allows users to represent and manipulate multimodaldata to build AI applications such as neural search and generative AI. 
 
 As you have seen in the [previous section](array.md), the fundamental building block of DocArray is the [`BaseDoc`][docarray.base_doc.doc.BaseDoc] class which represents a *single* document, a *single* datapoint.
 
diff --git a/docs/user_guide/sending/api/jina.md b/docs/user_guide/sending/api/jina.md
index eb0f13e1cc3..360c61ddf62 100644
--- a/docs/user_guide/sending/api/jina.md
+++ b/docs/user_guide/sending/api/jina.md
@@ -4,7 +4,7 @@ In this example we'll build an audio-to-text app using [Jina](https://docs.jina.
 
 We will use: 
 
-* DocArray V2: To load and preprocess multimodal data such as image, text and audio.
+* DocArray >=0.30: To load and preprocess multimodal data such as image, text and audio.
 * Jina: To serve the model quickly and create a client.
 
 ## Install packages
diff --git a/docs/user_guide/sending/first_step.md b/docs/user_guide/sending/first_step.md
index d13568c8e2f..0b80ad9d532 100644
--- a/docs/user_guide/sending/first_step.md
+++ b/docs/user_guide/sending/first_step.md
@@ -1,7 +1,7 @@
 # Introduction
 
 In the representation section we saw how to use [`BaseDoc`][docarray.base_doc.doc.BaseDoc], [`DocList`][docarray.array.doc_list.doc_list.DocList] and [`DocVec`][docarray.array.doc_vec.doc_vec.DocVec]
-to represent multi-modal data. In this section we will see **how to send such data over the wire**.
+to represent multimodaldata. In this section we will see **how to send such data over the wire**.
 
 This section is divided into two parts:
 
diff --git a/docs/user_guide/storing/first_step.md b/docs/user_guide/storing/first_step.md
index 53068e07ddd..96fe45d6800 100644
--- a/docs/user_guide/storing/first_step.md
+++ b/docs/user_guide/storing/first_step.md
@@ -1,6 +1,6 @@
 # Introduction
 
-In the previous sections we saw how to use [`BaseDoc`][docarray.base_doc.doc.BaseDoc], [`DocList`][docarray.array.doc_list.doc_list.DocList] and [`DocVec`][docarray.array.doc_vec.doc_vec.DocVec] to represent multi-modal data and send it over the wire.
+In the previous sections we saw how to use [`BaseDoc`][docarray.base_doc.doc.BaseDoc], [`DocList`][docarray.array.doc_list.doc_list.DocList] and [`DocVec`][docarray.array.doc_vec.doc_vec.DocVec] to represent multimodaldata and send it over the wire.
 In this section we will see how to store and persist this data.
 
 DocArray offers two ways of storing your data, each of which have their own documentation sections:
diff --git a/mkdocs.yml b/mkdocs.yml
index bcf37959314..73d789e60b4 100644
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -5,7 +5,7 @@ repo_name: docarray/docarray
 repo_url: https://github.com/docarray/docarray
 edit_uri: ''
 theme:
-  logo: assets/logo-light.svg
+  logo: assets/docarray-colorful.svg
 
   favicon: assets/favicon.png
   name: material

From e74a6515a1fa3c471503bf0ea14095e8c9a14b2b Mon Sep 17 00:00:00 2001
From: Han Xiao <han.xiao@jina.ai>
Date: Thu, 22 Jun 2023 23:26:12 +0800
Subject: [PATCH 2/5] chore: fix docarray v1v2 terms

Signed-off-by: Han Xiao <han.xiao@jina.ai>
---
 docs/user_guide/representing/array.md | 2 +-
 docs/user_guide/storing/first_step.md | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/user_guide/representing/array.md b/docs/user_guide/representing/array.md
index c659461d1bd..1d37a73b8a2 100644
--- a/docs/user_guide/representing/array.md
+++ b/docs/user_guide/representing/array.md
@@ -1,6 +1,6 @@
 # Array of documents
 
-DocArray allows users to represent and manipulate multimodaldata to build AI applications such as neural search and generative AI. 
+DocArray allows users to represent and manipulate multimodal data to build AI applications such as neural search and generative AI. 
 
 As you have seen in the [previous section](array.md), the fundamental building block of DocArray is the [`BaseDoc`][docarray.base_doc.doc.BaseDoc] class which represents a *single* document, a *single* datapoint.
 
diff --git a/docs/user_guide/storing/first_step.md b/docs/user_guide/storing/first_step.md
index 96fe45d6800..e8f7ab80315 100644
--- a/docs/user_guide/storing/first_step.md
+++ b/docs/user_guide/storing/first_step.md
@@ -1,6 +1,6 @@
 # Introduction
 
-In the previous sections we saw how to use [`BaseDoc`][docarray.base_doc.doc.BaseDoc], [`DocList`][docarray.array.doc_list.doc_list.DocList] and [`DocVec`][docarray.array.doc_vec.doc_vec.DocVec] to represent multimodaldata and send it over the wire.
+In the previous sections we saw how to use [`BaseDoc`][docarray.base_doc.doc.BaseDoc], [`DocList`][docarray.array.doc_list.doc_list.DocList] and [`DocVec`][docarray.array.doc_vec.doc_vec.DocVec] to represent multimodal data and send it over the wire.
 In this section we will see how to store and persist this data.
 
 DocArray offers two ways of storing your data, each of which have their own documentation sections:

From 3bed9265c0ff089dcc754118af8b4f3b6664bc98 Mon Sep 17 00:00:00 2001
From: Han Xiao <han.xiao@jina.ai>
Date: Thu, 22 Jun 2023 23:27:18 +0800
Subject: [PATCH 3/5] chore: fix docarray v1v2 terms

Signed-off-by: Han Xiao <han.xiao@jina.ai>
---
 docs/user_guide/sending/first_step.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/user_guide/sending/first_step.md b/docs/user_guide/sending/first_step.md
index 0b80ad9d532..d57d08b4c33 100644
--- a/docs/user_guide/sending/first_step.md
+++ b/docs/user_guide/sending/first_step.md
@@ -1,7 +1,7 @@
 # Introduction
 
 In the representation section we saw how to use [`BaseDoc`][docarray.base_doc.doc.BaseDoc], [`DocList`][docarray.array.doc_list.doc_list.DocList] and [`DocVec`][docarray.array.doc_vec.doc_vec.DocVec]
-to represent multimodaldata. In this section we will see **how to send such data over the wire**.
+to represent multimodal data. In this section we will see **how to send such data over the wire**.
 
 This section is divided into two parts:
 

From 242c5f9d80029e5cfab53e976f2e26b325c15a31 Mon Sep 17 00:00:00 2001
From: Han Xiao <han.xiao@jina.ai>
Date: Thu, 22 Jun 2023 23:31:44 +0800
Subject: [PATCH 4/5] chore: fix docarray v1v2 terms

Signed-off-by: Han Xiao <han.xiao@jina.ai>
---
 README.md | 19 ++++++++++---------
 1 file changed, 10 insertions(+), 9 deletions(-)

diff --git a/README.md b/README.md
index b9c286c5ab4..656c595a871 100644
--- a/README.md
+++ b/README.md
@@ -12,7 +12,7 @@
 <a href="https://discord.gg/WaMp6PVPgR"><img src="https://dcbadge.vercel.app/api/server/WaMp6PVPgR?theme=default-inverted&style=flat-square"></a>
 </p>
 
-> > **Note**
+> **Note**
 > The README you're currently viewing is for DocArray>0.30, which introduces some significant changes from DocArray 0.21. If you wish to continue using the older DocArray <=0.21, ensure you install it via `pip install docarray==0.21`. Refer to its [codebase](https://github.com/docarray/docarray/tree/v0.21.0), [documentation](https://docarray.jina.ai), and [its hot-fixes branch](https://github.com/docarray/docarray/tree/docarray-v1-fixes) for more information.
 
 
@@ -33,16 +33,17 @@ To install DocArray from the CLI, run the following command:
 pip install -U docarray
 ```
 
-> > **Note**
+> **Note**
 > To use DocArray <=0.21, make sure you install via `pip install docarray==0.21` and check out its [codebase](https://github.com/docarray/docarray/tree/v0.21.0) and [docs](https://docarray.jina.ai) and [its hot-fixes branch](https://github.com/docarray/docarray/tree/docarray-v1-fixes).
 
-> :bulb: **New to DocArray?** Depending on your use case and background, there are multiple ways to learn about DocArray:
-> 
-> - [Coming from pure PyTorch or TensorFlow](#coming-from-pytorch)
-> - [Coming from Pydantic](#coming-from-pydantic)
-> - [Coming from FastAPI](#coming-from-fastapi)
-> - [Coming from a vector database](#coming-from-vector-database)
-> - [Coming from Langchain](#coming-from-langchain)
+## Get Started
+New to DocArray? Depending on your use case and background, there are multiple ways to learn about DocArray:
+ 
+- [Coming from pure PyTorch or TensorFlow](#coming-from-pytorch)
+- [Coming from Pydantic](#coming-from-pydantic)
+- [Coming from FastAPI](#coming-from-fastapi)
+- [Coming from a vector database](#coming-from-vector-database)
+- [Coming from Langchain](#coming-from-langchain)
 
 
 ## Represent

From 3ad7089d92d8cedc1350d73d03f87ba7cb675ab6 Mon Sep 17 00:00:00 2001
From: Han Xiao <han.xiao@jina.ai>
Date: Thu, 22 Jun 2023 23:34:24 +0800
Subject: [PATCH 5/5] chore: fix docarray v1v2 terms

Signed-off-by: Han Xiao <han.xiao@jina.ai>
---
 docs/assets/docarray-dark.svg | 2 +-
 mkdocs.yml                    | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/assets/docarray-dark.svg b/docs/assets/docarray-dark.svg
index 7bb9d21c90e..e8c43ac48d4 100644
--- a/docs/assets/docarray-dark.svg
+++ b/docs/assets/docarray-dark.svg
@@ -2,7 +2,7 @@
 <svg width="320px" height="320px" viewBox="0 0 320 320" version="1.1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
     <title>docarray-dark 2</title>
     <g id="docarray-dark-2" stroke="none" stroke-width="1" fill="none" fill-rule="evenodd">
-        <g id="编组" transform="translate(31.000000, 31.000000)" fill="#FBCB67" fill-rule="nonzero">
+        <g id="编组" transform="translate(31.000000, 31.000000)" fill="#FFFFFF" fill-rule="nonzero">
             <g id="编组-57" transform="translate(129.000000, 129.000000) scale(-1, 1) translate(-129.000000, -129.000000) ">
                 <path d="M155,0.0045310358 C160.42678,0.0045310358 164.84543,4.32478085 164.996032,9.71569372 L165,10 C165,15.5228475 160.522847,20 155,20 L129,20 C69.9989231,20 21.9467921,66.8780096 20.0576823,125.419953 L20.0146029,127.197486 L20,129 C20,188.597047 67.8298231,237.022841 127.197486,237.985397 L155,237.996137 C160.523473,237.998271 165,242.476528 165,248 C165,253.522847 160.522847,258 155,258 L129,258 C58.4677146,258 1.15645429,201.394063 0.0172823144,131.133251 L0,129 C0,58.4677146 56.6059375,1.15645429 126.866749,0.0172823144 L155,0.0045310358 Z M240,0 C249.830669,0 257.820682,7.88078333 257.997024,17.6693348 L258,18 L258,248 C258,253.42924 253.673329,257.847932 248.279905,257.996158 L248,258 L204,258 C198.477153,258 194,253.522847 194,248 C194,242.477153 198.477153,238 204,238 L238,238 L238,20 L204,20 C198.477153,20 194,15.5228475 194,10 C194,4.4771525 198.477153,0 204,0 L240,0 Z M151.5,126 C172.210678,126 189,142.789322 189,163.5 C189,184.210678 172.210678,201 151.5,201 C130.789322,201 114,184.210678 114,163.5 C114,161.329305 114.184434,159.201688 114.538488,157.131962 L114.679179,156.358558 L124.117669,156.35849 C126.796188,156.35849 129.425849,155.641266 131.733454,154.281331 C138.772774,150.132866 141.180601,141.130787 137.209662,134.03715 L137.04051,133.742705 L134.788887,129.920123 C139.820587,127.411237 145.495487,126 151.5,126 Z M89.7549106,88.0242111 C91.100011,85.6125382 94.1454775,84.7479105 96.5571504,86.0930109 C97.2462577,86.4773576 97.8333702,87.0186169 98.2718658,87.6712617 L98.4292442,87.9211357 L128.425282,138.819895 C129.82731,141.198924 129.035293,144.264076 126.656264,145.666104 C125.972529,146.069047 125.203973,146.302717 124.414554,146.349669 L124.117669,146.35849 L65.7330609,146.35849 C62.9716371,146.35849 60.7330609,144.119914 60.7330609,141.35849 C60.7330609,140.600665 60.9053061,139.854027 61.2352488,139.174527 L61.3663409,138.92297 L89.7549106,88.0242111 Z M200.311688,65 C201.909369,65 203.215349,66.24892 203.306596,67.8237272 L203.311688,68 L203.311688,127 C203.311688,128.597681 202.062768,129.903661 200.488006,129.994907 L200.311738,130 L185.175,129.999 L184.696417,129.525108 C176.194189,121.212238 164.852133,116.319831 152.707395,116.015135 L152.099498,116.003729 L151.5,116 C150.380425,116 149.068887,116.085584 147.565386,116.256751 L147.05711,116.316977 L146.178383,116.431441 L145.260158,116.563515 L144.786234,116.636155 L143.808762,116.794644 L142.791791,116.970742 L142.268493,117.065395 L141.192274,117.267908 L140.639353,117.375768 L139.503887,117.604696 L138.311,117.855 L138.311688,68 C138.311688,66.4614925 139.469809,65.1934785 140.961825,65.0201832 L141.135416,65.0050927 L141.311688,65 L200.311688,65 Z" id="形状"></path>
             </g>
diff --git a/mkdocs.yml b/mkdocs.yml
index 73d789e60b4..a7ad4cf8500 100644
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -5,7 +5,7 @@ repo_name: docarray/docarray
 repo_url: https://github.com/docarray/docarray
 edit_uri: ''
 theme:
-  logo: assets/docarray-colorful.svg
+  logo: assets/docarray-dark.svg
 
   favicon: assets/favicon.png
   name: material