Comments for ArangoDB https://arangodb.com/ The database for graph and beyond Fri, 06 Oct 2023 06:04:54 +0000 hourly 1 https://wordpress.org/?v=6.3.2 Comment on A story of a memory leak in GO: How to properly use time.After() by Laura Cope https://arangodb.com/2020/09/a-story-of-a-memory-leak-in-go-how-to-properly-use-time-after/#comment-31 Fri, 31 Mar 2023 13:23:50 +0000 https://arangodb.com/?p=35186#comment-31 In reply to Todd.

yes, the below line:
go func() { <-timeout }() // prevent leak
could also work, but it creates a separate go-routine which eventually will be finished depends on the timeout variable.
When the timeout variable was high then go-routine would exist during this timeout, which is not good for a performance. It is better to close a timer when we know that it is not longer required.

]]>
Comment on A story of a memory leak in GO: How to properly use time.After() by Todd https://arangodb.com/2020/09/a-story-of-a-memory-leak-in-go-how-to-properly-use-time-after/#comment-30 Thu, 16 Feb 2023 20:47:05 +0000 https://arangodb.com/?p=35186#comment-30 couldn’t you also do

“`
timeout := time.After(time.Second)
select {
  case <-timeout:
// do something after 1 second.
  case <-ctx.Done():
go func() { <-timeout }() // prevent leak
// do something when context is finished.
  }
“`

]]>
Comment on Introducing the new ArangoDB Datasource for Apache Spark by Michele Rastelli https://arangodb.com/2022/03/introducing-the-new-arangodb-datasource-for-apache-spark/#comment-11 Thu, 21 Jul 2022 13:13:33 +0000 https://www.arangodb.com/?p=39584#comment-11 In reply to quanns.

You can find a working PySpark demo at: https://github.com/arangodb/arangodb-spark-datasource/tree/main/demo#pythonpyspark-demo

]]>
Comment on Introducing the new ArangoDB Datasource for Apache Spark by quanns https://arangodb.com/2022/03/introducing-the-new-arangodb-datasource-for-apache-spark/#comment-10 Mon, 20 Jun 2022 06:36:25 +0000 https://www.arangodb.com/?p=39584#comment-10 Hi Rasetelli,
Does this driver support PySpark. I tried to use with pyspark and it doesn’t work. I can not find any documents for the integration between arango-spark lib and pyspark.

]]>
Comment on Word Embeddings in ArangoDB by Alex Geenen https://arangodb.com/2021/06/word-embeddings-in-arangodb/#comment-19 Tue, 06 Jul 2021 11:44:38 +0000 https://www.arangodb.com/?p=37375#comment-19 In reply to Fabio Mencoboni.

Hi Fabio,

If I understand correctly, this approach is using the DistillBERT model in python to calculate embeddings for documents which are then stored in ArangoDB.

Yes that’s correct!

I have seen elsewhere the use of ArangoSearch, which I think did tokenization and embedding directly in the database. Do I understand the difference between these approaches correctly?

Yes, ArangoSearch allows you to perform tokenization and full-text search directly in the database. At this point, word embeddings aren’t directly supported, which is what this tutorial lets you do. ArangoSearch does support vector space models such as BM-25 and TF-IDF for scoring search results. Please see here if you want to learn more about them.

The query uses the expression below to calculate the dot-product of the query embedding to document embedding. This implies a slower single-thread approach, though if ArangoDB is calculating this value for multiple documents concurrently under the hood it would still get the benefit of multi-core processors. Any thoughts/comments on performance?

Great question! The answer is that it depends. If you’re querying a single server, it will use a sequential scan (so a single thread). If you’re querying a collection on a cluster, and the collection is sharded across different servers, then there will be concurrency at a database server level, but within those server processes it will also be scanned sequentially.

]]>
Comment on Word Embeddings in ArangoDB by Fabio Mencoboni https://arangodb.com/2021/06/word-embeddings-in-arangodb/#comment-18 Fri, 02 Jul 2021 12:24:49 +0000 https://www.arangodb.com/?p=37375#comment-18 Very cool tutorial- thanks for sharing. I am really excited about using ArangoDB with Semantic queries, and this is a great overview. A couple questions:
* If I understand correctly, this approach is using the DistillBERT model in python to calculate embeddings for documents which are then stored in ArangoDB.
* I have seen elsewhere the use of ArangoSearch, which I think did tokenization and embedding directly in the database. Do I understand the difference between these approaches correctly?
* The query uses the expression below to calculate the dot-product of the query embedding to document embedding. This implies a slower single-thread approach, though if ArangoDB is calculating this value for multiple documents concurrently under the hood it would still get the benefit of multi-core processors. Any thoughts/comments on performance?
LET numerator = (SUM(
FOR i in RANGE(0,767)
RETURN TO_NUMBER(NTH(descr_emb, i)) * TO_NUMBER(NTH(v.word_emb, i))
))

]]>
Comment on Introducing Developer Deployments on ArangoDB ArangoGraph by Ewout Prangsma https://arangodb.com/2021/06/introducing-developer-deployments-on-arangodb-oasis/#comment-17 Wed, 23 Jun 2021 09:10:41 +0000 https://www.arangodb.com/?p=37257#comment-17 In reply to Rishav Sharan.

hi Rishav,

Thanks for your comment.
We’ve chosen to offer a Free-to-try deployment that has enough resources to let you really try out all of the features of ArangoDB and Oasis.
That is free for 14 days.

The Developer deployments are aimed at individual developers.
If you need a fully free option, you can run the ArangoDB database in a docker container on your laptop or any small VPS.

Ewout

]]>
Comment on Introducing Developer Deployments on ArangoDB ArangoGraph by Rishav Sharan https://arangodb.com/2021/06/introducing-developer-deployments-on-arangodb-oasis/#comment-16 Tue, 22 Jun 2021 07:30:20 +0000 https://www.arangodb.com/?p=37257#comment-16 You folks really need to add in a free tier for developers who want to learn ArangoDb or want to deploy small pet projects.

All other DBaaS providers have a free tier and frankly as developer somewhat interested in trying Arango cloud, I don’t see any reason to try it out if I can’t tinker around with it for my pet projects.

This developer account is welcome, but is not what we really want.

]]>
Comment on Auto-Generate GraphQL for ArangoDB by Laura Cope https://arangodb.com/2017/10/auto-generate-graphql-arangodb/#comment-138 Fri, 18 Jun 2021 13:43:10 +0000 https://www.arangodb.com/?p=23212#comment-138 In reply to raquel.

Hi! Thanks for the response, if you want to check out this following post it is more up to date and can help answer your question https://www.arangodb.com/docs/stable/foxx-reference-modules-graph-ql.html.

]]>
Comment on Auto-Generate GraphQL for ArangoDB by raquel https://arangodb.com/2017/10/auto-generate-graphql-arangodb/#comment-137 Thu, 17 Jun 2021 06:00:21 +0000 https://www.arangodb.com/?p=23212#comment-137 will this work for Mutation?

]]>