talk@lists.collectionspace.org

WE HAVE SUNSET THIS LISTSERV - Join us at collectionspace@lyrasislists.org

View all threads

Elasticsearch indices

JM
Jesse Martinez
Thu, Mar 31, 2016 1:37 AM

Hi all,

I've set up an instance of CSpace v4.3 along with Elasticsearch on the same
server. I see that CSpace automatically creates indices in ES based on
tenant binding settings. From what I gather, only newly created or edited
records get added to the ES instance. Is there a way to trigger a complete
indexing of all CSpace records? Or, does anyone have a handy script that
pulls our record data from CSpace and adds it to ES?

Also, is/will there be granular control over what gets indexed? One idea is
to expose ES for report creation (either through Jasper or another
ES-supported service) but I'm cautious about access to sensitive content.

Thanks!

Jesse

Hi all, I've set up an instance of CSpace v4.3 along with Elasticsearch on the same server. I see that CSpace automatically creates indices in ES based on tenant binding settings. From what I gather, only newly created or edited records get added to the ES instance. Is there a way to trigger a complete indexing of all CSpace records? Or, does anyone have a handy script that pulls our record data from CSpace and adds it to ES? Also, is/will there be granular control over what gets indexed? One idea is to expose ES for report creation (either through Jasper or another ES-supported service) but I'm cautious about access to sensitive content. Thanks! Jesse
RL
Ray Lee
Thu, Mar 31, 2016 8:19 AM

Aaagh, I'm jealous, I've been eager to try out the ES support. But I
haven't had time, so I can't help. Let us know what you find!

Ray

On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez mjesse@gmail.com wrote:

Hi all,

I've set up an instance of CSpace v4.3 along with Elasticsearch on the
same server. I see that CSpace automatically creates indices in ES based on
tenant binding settings. From what I gather, only newly created or edited
records get added to the ES instance. Is there a way to trigger a complete
indexing of all CSpace records? Or, does anyone have a handy script that
pulls our record data from CSpace and adds it to ES?

Also, is/will there be granular control over what gets indexed? One idea
is to expose ES for report creation (either through Jasper or another
ES-supported service) but I'm cautious about access to sensitive content.

Thanks!

Jesse


Talk mailing list
Talk@lists.collectionspace.org

http://lists.collectionspace.org/mailman/listinfo/talk_lists.collectionspace.org

Aaagh, I'm jealous, I've been eager to try out the ES support. But I haven't had time, so I can't help. Let us know what you find! Ray On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez <mjesse@gmail.com> wrote: > Hi all, > > I've set up an instance of CSpace v4.3 along with Elasticsearch on the > same server. I see that CSpace automatically creates indices in ES based on > tenant binding settings. From what I gather, only newly created or edited > records get added to the ES instance. Is there a way to trigger a complete > indexing of all CSpace records? Or, does anyone have a handy script that > pulls our record data from CSpace and adds it to ES? > > Also, is/will there be granular control over what gets indexed? One idea > is to expose ES for report creation (either through Jasper or another > ES-supported service) but I'm cautious about access to sensitive content. > > Thanks! > > Jesse > > _______________________________________________ > Talk mailing list > Talk@lists.collectionspace.org > > http://lists.collectionspace.org/mailman/listinfo/talk_lists.collectionspace.org > >
RM
Richard Millet
Fri, Apr 1, 2016 6:14 PM

Jesse,

I'll be adding a "reindex" API to the next major release.  I'm adding support to reindex both the Elasticsearch index and the PostgreSQL full text index.  The API will look something like this:

For PostgreSQL:

Reindex everything: POST /index/fulltext

Reindex all docs of a certain type: POST /{resource-type}/index/fulltext

Reindex a specific document: POST /{resource-type}/{CSID}/index/fulltext

For ElasticSearch:

Reindex everything: POST /index/esi

Reindex all docs of a certain type: POST /{resource-type}/index/esi

Reindex a specific document: POST /{resource-type}/{CSID}/index/esi

In the meantime, you could try creating a Nuxeo plugin to reindex the existing data into Elasticsearch using this code as an example:

https://github.com/nuxeo/nuxeo/blob/afd1d6c026b3b65c8ce07788bd874b183bc20e4f/nuxeo-features/nuxeo-elasticsearch/nuxeo-elasticsearch-automation/src/main/java/org/nuxeo/ecm/automation/elasticsearch/Indexing.java and it's available since HF25.

To see examples of existing Nuxeo plugins take a look here:

https://github.com/collectionspace/services/tree/master/3rdparty/nuxeo/nuxeo-platform-listener

-Richard


From: Talk talk-bounces@lists.collectionspace.org on behalf of Ray Lee rhlee@berkeley.edu
Sent: Thursday, March 31, 2016 1:19 AM
To: Jesse Martinez
Cc: CollectionSpace Talk List
Subject: Re: [Talk] Elasticsearch indices

Aaagh, I'm jealous, I've been eager to try out the ES support. But I haven't had time, so I can't help. Let us know what you find!

Ray

On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez <mjesse@gmail.commailto:mjesse@gmail.com> wrote:
Hi all,

I've set up an instance of CSpace v4.3 along with Elasticsearch on the same server. I see that CSpace automatically creates indices in ES based on tenant binding settings. From what I gather, only newly created or edited records get added to the ES instance. Is there a way to trigger a complete indexing of all CSpace records? Or, does anyone have a handy script that pulls our record data from CSpace and adds it to ES?

Also, is/will there be granular control over what gets indexed? One idea is to expose ES for report creation (either through Jasper or another ES-supported service) but I'm cautious about access to sensitive content.

Thanks!

Jesse


Talk mailing list
Talk@lists.collectionspace.orgmailto:Talk@lists.collectionspace.org
http://lists.collectionspace.org/mailman/listinfo/talk_lists.collectionspace.org

Jesse, I'll be adding a "reindex" API to the next major release. I'm adding support to reindex both the Elasticsearch index and the PostgreSQL full text index. The API will look something like this: For PostgreSQL: Reindex everything: POST /index/fulltext Reindex all docs of a certain type: POST /{resource-type}/index/fulltext Reindex a specific document: POST /{resource-type}/{CSID}/index/fulltext For ElasticSearch: Reindex everything: POST /index/esi Reindex all docs of a certain type: POST /{resource-type}/index/esi Reindex a specific document: POST /{resource-type}/{CSID}/index/esi In the meantime, you could try creating a Nuxeo plugin to reindex the existing data into Elasticsearch using this code as an example: https://github.com/nuxeo/nuxeo/blob/afd1d6c026b3b65c8ce07788bd874b183bc20e4f/nuxeo-features/nuxeo-elasticsearch/nuxeo-elasticsearch-automation/src/main/java/org/nuxeo/ecm/automation/elasticsearch/Indexing.java and it's available since HF25. To see examples of existing Nuxeo plugins take a look here: https://github.com/collectionspace/services/tree/master/3rdparty/nuxeo/nuxeo-platform-listener -Richard ________________________________ From: Talk <talk-bounces@lists.collectionspace.org> on behalf of Ray Lee <rhlee@berkeley.edu> Sent: Thursday, March 31, 2016 1:19 AM To: Jesse Martinez Cc: CollectionSpace Talk List Subject: Re: [Talk] Elasticsearch indices Aaagh, I'm jealous, I've been eager to try out the ES support. But I haven't had time, so I can't help. Let us know what you find! Ray On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez <mjesse@gmail.com<mailto:mjesse@gmail.com>> wrote: Hi all, I've set up an instance of CSpace v4.3 along with Elasticsearch on the same server. I see that CSpace automatically creates indices in ES based on tenant binding settings. From what I gather, only newly created or edited records get added to the ES instance. Is there a way to trigger a complete indexing of all CSpace records? Or, does anyone have a handy script that pulls our record data from CSpace and adds it to ES? Also, is/will there be granular control over what gets indexed? One idea is to expose ES for report creation (either through Jasper or another ES-supported service) but I'm cautious about access to sensitive content. Thanks! Jesse _______________________________________________ Talk mailing list Talk@lists.collectionspace.org<mailto:Talk@lists.collectionspace.org> http://lists.collectionspace.org/mailman/listinfo/talk_lists.collectionspace.org
JB
John B Lowe
Wed, Sep 21, 2016 2:59 AM

Did anything come of this?  Does the proposed "reindex API" work?

Is anyone using the ElasticSearch capabilities yet?

Just wondering,

John

On Fri, Apr 1, 2016 at 11:14 AM, Richard Millet richard.millet@lyrasis.org
wrote:

Jesse,

I'll be adding a "reindex" API to the next major release.  I'm adding
support to reindex both the Elasticsearch index and the PostgreSQL full
text index.  The API will look something like this:

For PostgreSQL:

Reindex everything: POST /index/fulltext

Reindex all docs of a certain type: POST /{resource-type}/index/fulltext

Reindex a specific document: POST /{resource-type}/{CSID}/index/fulltext

For ElasticSearch:

Reindex everything: POST /index/esi

Reindex all docs of a certain type: POST /{resource-type}/index/esi

Reindex a specific document: POST /{resource-type}/{CSID}/index/esi

In the meantime, you could try creating a Nuxeo plugin to reindex the
existing data into Elasticsearch using this code as an example:

https://github.com/nuxeo/nuxeo/blob/afd1d6c026b3b65c8ce07788
bd874b183bc20e4f/nuxeo-features/nuxeo-elasticsearch/nuxeo-elasticsearch-
automation/src/main/java/org/nuxeo/ecm/automation/
elasticsearch/Indexing.java and it's available since HF25.

To see examples of existing Nuxeo plugins take a look here:

https://github.com/collectionspace/services/tree/master/
3rdparty/nuxeo/nuxeo-platform-listener

-Richard


From: Talk talk-bounces@lists.collectionspace.org on behalf of Ray
Lee rhlee@berkeley.edu
Sent: Thursday, March 31, 2016 1:19 AM
To: Jesse Martinez
Cc: CollectionSpace Talk List
Subject: Re: [Talk] Elasticsearch indices

Aaagh, I'm jealous, I've been eager to try out the ES support. But I
haven't had time, so I can't help. Let us know what you find!

Ray

On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez mjesse@gmail.com wrote:

Hi all,

I've set up an instance of CSpace v4.3 along with Elasticsearch on the
same server. I see that CSpace automatically creates indices in ES based on
tenant binding settings. From what I gather, only newly created or edited
records get added to the ES instance. Is there a way to trigger a complete
indexing of all CSpace records? Or, does anyone have a handy script that
pulls our record data from CSpace and adds it to ES?

Also, is/will there be granular control over what gets indexed? One idea
is to expose ES for report creation (either through Jasper or another
ES-supported service) but I'm cautious about access to sensitive content.

Thanks!

Jesse

Did anything come of this? Does the proposed "reindex API" work? Is anyone using the ElasticSearch capabilities yet? Just wondering, John On Fri, Apr 1, 2016 at 11:14 AM, Richard Millet <richard.millet@lyrasis.org> wrote: > Jesse, > > I'll be adding a "reindex" API to the next major release. I'm adding > support to reindex both the Elasticsearch index and the PostgreSQL full > text index. The API will look something like this: > > For PostgreSQL: > > Reindex everything: *POST* /index/fulltext > > Reindex all docs of a certain type: *POST* /{resource-type}/index/fulltext > > Reindex a specific document: *POST* /{resource-type}/{CSID}/index/fulltext > > > For ElasticSearch: > > Reindex everything: *POST* /index/esi > > Reindex all docs of a certain type: *POST* /{resource-type}/index/esi > > Reindex a specific document: *POST* /{resource-type}/{CSID}/index/esi > > > In the meantime, you could try creating a Nuxeo plugin to reindex the > existing data into Elasticsearch using this code as an example: > > > https://github.com/nuxeo/nuxeo/blob/afd1d6c026b3b65c8ce07788 > bd874b183bc20e4f/nuxeo-features/nuxeo-elasticsearch/nuxeo-elasticsearch- > automation/src/main/java/org/nuxeo/ecm/automation/ > elasticsearch/Indexing.java and it's available since HF25. > > > To see examples of existing Nuxeo plugins take a look here: > > > https://github.com/collectionspace/services/tree/master/ > 3rdparty/nuxeo/nuxeo-platform-listener > > > -Richard > > > ------------------------------ > *From:* Talk <talk-bounces@lists.collectionspace.org> on behalf of Ray > Lee <rhlee@berkeley.edu> > *Sent:* Thursday, March 31, 2016 1:19 AM > *To:* Jesse Martinez > *Cc:* CollectionSpace Talk List > *Subject:* Re: [Talk] Elasticsearch indices > > Aaagh, I'm jealous, I've been eager to try out the ES support. But I > haven't had time, so I can't help. Let us know what you find! > > Ray > > On Wed, Mar 30, 2016 at 6:37 PM, Jesse Martinez <mjesse@gmail.com> wrote: > >> Hi all, >> >> I've set up an instance of CSpace v4.3 along with Elasticsearch on the >> same server. I see that CSpace automatically creates indices in ES based on >> tenant binding settings. From what I gather, only newly created or edited >> records get added to the ES instance. Is there a way to trigger a complete >> indexing of all CSpace records? Or, does anyone have a handy script that >> pulls our record data from CSpace and adds it to ES? >> >> Also, is/will there be granular control over what gets indexed? One idea >> is to expose ES for report creation (either through Jasper or another >> ES-supported service) but I'm cautious about access to sensitive content. >> >> Thanks! >> >> Jesse >> >