Close Search

Type your search terms above and press return to see the search results.

Saskia Vola

Textmining, NLP and Elasticsearch consulting

Menu
  • Home
  • Services
  • Technology
  • Projects
  • About
  • Contact
Search
2021-11-22
comment 0

Information Extraction with Elasticsearch – How to enrich your data

We know that data is the new gold. The quality of data is super important and can really make a difference for any website or online business. Nowadays many online businesses aggregate data from various sources and build a search engine for it. They earn… Read more

Filed under: Elasticsearch, Textmining
2019-06-04
comment 0

How to speed up indexing into Elasticsearch

There are in general 2 different scenarios when it comes to indexing. Either you have to deal with a stream of data, like logs, Twitter Stream, newsfeeds etc. or you have nightly database dumps. There might be cases where you have both nightly database dumps… Read more

Filed under: Elasticsearch
2019-04-23
comment 1

Named Entity Annotations in Elasticsearch

This blogpost will show how you can use Elasticsearch to extract Named Entities and store them as annotations. There is a really nice plugin written by one of the main Elasticsearch developers Alexander Reelsen: https://github.com/spinscale/elasticsearch-ingest-opennlp This plugin wraps the library OpenNLP and allows to extract… Read more

Filed under: Elasticsearch, Textmining
2019-04-15
comment 0

When simple is better: The boolean similarity module

I had a lecture about Information Retrieval at university. That’s the field that studies search engines. In the first few classes we learned about the history and evolution of language models that are used for search engines. The most basic and simple form of a… Read more

Filed under: Elasticsearch, Ranking
2019-04-01
comments 2

How to build a self-learning search engine with Elasticsearch

This blogpost will walk you through a demo that shows how you can use Elasticsearch to build a self-learning search engine. You can apply this technique if you have a user facing UI and if you can access the webanalytics that tracks the user-interaction with… Read more

Filed under: Elasticsearch, Ranking
2018-08-15
comment 0

Test-Driven Relevance Tuning of Elasticsearch using the Ranking Evaluation API

This blog post is written for engineers that are always looking for ways to improve the result sets of their search application built on Elasticsearch. The goal of this post is to raise awareness of why you should care about relevance, what components are involved… Read more

Filed under: Elasticsearch, Ranking
2017-05-24
comment 0

How to use ElasticSearch for Natural Language Processing and Text Mining — Part 2

Welcome to Part 2 of How to use Elasticsearch for Natural Language Processing and Text Mining. It’s been some time since Part 1, so you might want to brush up on the basics before getting started. This time we’ll focus on one very important type… Read more

Filed under: Elasticsearch, Textmining
2017-05-02
comment 0

Text Classification made easy with Elasticsearch

Elasticsearch is widely used as a search and analytics engine. Its capabilities as a text mining API are not as well known. In the following article I’d like to show how text classification can be done with Elasticsearch. With a background in computational linguistics and… Read more

Filed under: Elasticsearch, Textmining
2016-12-30
comment 0

How to use ElasticSearch for Natural Language Processing and Text Mining — Part 1

ElasticSearch is a search engine and an analytics platform. But it offers many features that are useful for standard Natural Language Processing and Text Mining tasks. Read more…

Filed under: Elasticsearch, Textmining
2016-12-02
comment 0

Why I decided to become a freelancer

Why I decided to become a freelancer Back in early 2014 I just had quit my job at a startup because they had pivoted from semantic technologies to something that I couldn’t relate to anymore. So I applied for new jobs at larger companies. I… Read more

Filed under: Uncategorized
« Older Posts
  • GitHub
  • LinkedIn
  • Xing
    © 2019 Saskia Vola | Imprint