CFP: Bug hunting with Apache Lucene

Uwe Schindler uschindler at apache.org
Sat Dec 12 13:11:24 CET 2015


Title: Bug hunting with Apache Lucene

Abstract:
When Oracle released Java 7 GA, it contained a serious loop optimization bug, affecting Apache projects like Lucene, Solr, or Elasticsearch. Since that time, the Apache Lucene team works together with the Oracle and OpenJDK developers to test early-access releases with Apache Lucene's codebase. Due to Apache Lucene's heavy, low level code with many tight loops and unusual code patterns, it is an ideal testing environment for the Hotspot optimizer. Loops, iterators, and bitset operations are executed millions of times per full text query to execute finite state automata, collect and filter results, and sort them. Bugs caused by mis-compilation of code that are hard to reproduce in isolation can be easily reproduced with Lucene. In addition, Lucene uses a pseudo-randomized testing infrastructure triggering different Hotspot optimizations on each test execution. In this talk, Uwe will show the current state of Java 9 EA testing. He will also present the problems caused by hard to reproduce issues, e.g. occurred after the changes to arraycopy or AVX optimizations in Hotspot. [Optional: As a short excursion, he will also present typical changes that had to be made, to make Lucene's Java code work with Jigsaw.]

Bio:
Uwe is committer and PMC member of Apache Lucene and Solr. His main focus is on development of Lucene Java. He implemented fast numerical search and is maintaining the new attribute-based text analysis API. After the release of Java 7 in 2011 he started to intensify communication of Apache Lucene team with Oracle and OpenJDK team members and introduced the Java EA testing, running of his company's servers. He studied Physics at the University of Erlangen-Nuremberg and works as managing director for SD DataSolutions GmbH in Bremen, Germany, a company that provides consulting and support for Apache Lucene, Elasticsearch, and Apache Solr. He also works for “PANGAEA – Publishing Network for Geoscientific & Environmental Data” where he implemented the portal's geo-spatial retrieval functions with Lucene Java.  Uwe had talks about Lucene at various international conferences like the previous FOSDEM, Berlin Buzzwords, ApacheCon EU/US, Lucene Revolution, Lucene Eurocon, and various local meetups.

Recording/streaming on audio/video is acceptable under a CC-BY license. I am either fine with a short or long track, because the talk can be made as detailed as time allows :-)

-----
Uwe Schindler
uschindler at apache.org 
ASF Member, Apache Lucene PMC / Committer
Bremen, Germany
http://lucene.apache.org/




More information about the java-devroom mailing list