DatabaseLink and Libreoffice odb




In scientific projects, I often deal with large Excel files from partners as this is unfortunately still the way to store data. Always, one of my first steps after importing the data is to verify and check the consistency of all entries.

That being said, for projects that are under my control I like to make this more consistent by offering a small database with fixed types and relations. LibreOffice is probably an easy way to give me the opportunity to access consistent datasets and ensuring my colleagues can use a simple Excel-like form to insert datasets. To make this happen, I need to be able to open an LibreOffice database with Mathematica.


Having a libreoffice database, which should be an HSQLDB, how is it possible to use DatabaseLink` to connect to it.

The documentation indicates that I should use something like this

conn = OpenSQLConnection[
  JDBC["HSQL(Standalone)", "/path/to/file.odb"]

which creates an SQLConnection successfully, but SQLTableNames just gives {}. Am I doing something wrong?

Btw, my system is Linux 64bit with Mathematica 11.0.1 if that matters.


Posted 2017-02-17T03:00:37.563

Reputation: 109 574

halirutan, this may be an idiot question, but does the conn status shows "Open"? Another question: does SQLExecute[conn, "show tables"] also return "{ }"? – Rod – 2017-02-23T04:11:17.360

1@Rod Yes and yes. I get no error message at all and it seems everything works, but my db just has no tables (of course it has tables, but Mathematica won't show them). – halirutan – 2017-02-23T04:24:02.653

another (maybe) stupid question... why don't you try MySQL (or even MariaDB), for instance? I use MMA+MySQL on a daily basis and I have no problem at all with the integration between them... – Rod – 2017-02-23T04:26:20.410

1@Rod Because for a LibreOffice DB, my colleagues don't need to know anything more than they actually do. Creating simple "Forms" and "Reports" is easy and they can almost work like they did before. – halirutan – 2017-02-23T04:28:04.270

it's ok then... I'm really sorry I can't help you in this question. I really hope someone can help you with your bounty! +1 – Rod – 2017-02-23T04:33:10.477

1@Rod Actually, I tried SQLExecute[conn, "show tables"] and it gives $Failed (JDBC: unexpected token: show). But the connection is open. I just saw that it indeed seems to open the database because there are several temp files that appear with the extension lck, log, properties, script... – halirutan – 2017-02-23T04:39:28.243


not possible I think. odb files are zip files. see here. However, with MDB on Windows there is no problem. So you might want to use Windows ...

– Rolf Mertig – 2017-02-23T08:35:56.570

@RolfMertig That is truly sad. I looked through all the database drivers Mathematica supports and saw "MS Access" as well. – halirutan – 2017-02-27T03:33:29.810

@halirutan Yes. Sad that Libreoffice is not as good as it might be. And Linux is also not as good as it might be. But after your Trauerarbeit you could just go ahead and do use Windows ( and Mathematica etc. on it). It is not that bad, really. – Rolf Mertig – 2017-02-27T11:16:29.443

@halirutan Have you tried renaming the .odb file to .zip and extracting the contents. See If you are successful and need to modify and save see seems like someone put together an R package for rudimentary access Maybe someone will create a package for Mathematica

– Hans – 2017-02-28T18:58:41.827

@Hans Unfortunately, no success. The java code in the blog-post uses the same driver as Mathematica does: "org.hsqldb.jdbcDriver" and the protocol "jdbc:hsqldb:file:" seems to match as well. Still, I don't get any tables out of my database. – halirutan – 2017-03-02T05:24:49.410



A few tricks are required, but it is possible to create a database in LibreOffice and read the data via DatabaseLink in MMA. Here is a summary of worked for me with MMA 11.0, LibreOffice 5.0 and GNU/Linux.

Database Creation

The trick in creating the database is to use all capital letters for table names and column names. When I used lower case names, MMA could see the names, but I could not access the data with the SQL command I was using. Later, I will describe how to access data in tables with mixed case identifiers. However, if you can use upper case from the beginning, everything will be easier.

From the LibreOffice menu, I selected Insert->Table Design... to create a new database with one table. The table had 3 columns for integer, varchar and float data. I entered 3 rows of data, saved the table, and quit LibreOffice. It is probably a good idea to quit LibreOffice entirely.

The database filename is "db_137967.odb". I tried to open it in MMA, but could not get that to work. All MMA wanted to do was to create a new database, using "db_137967.odb" as the database name.

Database Extraction

The next trick is to extract the database files from the .odb file, which is simply a zip archive. Working at the command line, I created a scratch directory, moved db_137967.odb to the directory, and extracted the database files using the GNU unzip utility. No options required, just enter unzip db_137967.odb at the prompt to extract everything. Or if you want unzip db_137967.odb database/* will extract only the good stuff.

Unzipping the archive creates several files and directories. The five important files are in a directory named "database". I assumed all five of these files are required. I renamed (mv command) "log" to db_137697.log, renamed "data" to, and likewise for "properties", "backup" and "script". I moved these 5 files to the directory with my MMA notebooks. I noticed that unzipping the archive does not remove the "db_137967.odb" file, so LibreOffice can still read it.

I tried to unzip the .odb archive in MMA using Import[], but made no progress. It would be nice to have a MMA function that did this extraction.

It should be noted that it may be possible to create a conflict between the database name and the notebook name. My notebook name is "mma_137967.nb" and when I used the same name for the "log", "data", "properties", etc file, I had problems. So, I changed the database filenames to "da_137967", and the problems disappeared. Once it started working with different names, I did not try to re-create conflict.

Database Connection

Finally, we get to the MMA part. We have quit LibreOffice, we have extracted and renamed the 5 database files, and we are ready to read the data into MMA. Here's the MMA code to make the connection:

conn = OpenSQLConnection[
    ToFileName[{NotebookDirectory[]}, "db_137967"]

info = Transpose@SQLConnectionInformation[conn];

(*  {170, 2}  *)

A few things to note in the above code: first, don't forget the Needs[] statement. Second, we do not really give a filename, just the "db_137967" part. MMA will open the files it needs. In this example all 5 of the files were in my NotebookDirectory[]. After opening the connection, I read the info, made sure that I got 170 rows and 2 columns. Note the use of Transpose in the above.

Later, I used the Dataset command to obtain a scrollable view of the info. Of the 170 entries in info, the ones that relate to "identifiers" appear to be especially relevant. They can be extracted with

  StringContainsQ[#[[1]], ___ ~~ "Identifier" ~~ ___] &]


IdentifierQuoteString               "
StoresLowerCaseIdentifiers          False
StoresLowerCaseQuotedIdentifiers    False
StoresMixedCaseIdentifiers          False
StoresMixedCaseQuotedIdentifiers    False
StoresUpperCaseIdentifiers          True
StoresUpperCaseQuotedIdentifiers    False
SupportsMixedCaseIdentifiers        False
SupportsMixedCaseQuotedIdentifiers  True


From the above, one might suspect the database stores only the uppercase table names and column names. Likewise for names of views and of other database structures. However, these values also hint that mixed case names could be used, if they are quoted with the double quote character. As it turns out, lower case and mixed case identifiers are allowed, as we shall see.

Database Table Access

Here are the MMA commands that accessed the table data. The first command lists the table names.

SQLExecute[conn,"SELECT COUNT(*) FROM PROJECTS"]//TableForm



2   Cold Fusion          3.14159
3   Negative Gravity    -1.12358*10^7
1   Transmutation        1234.57

The second command lists the table name / column name pairs. The third command gives a count of the number of entries in the PROJECTS table. The fourth command displays up to 10 of the data entries in the table in alphabetical order by project name.

Mixed Case Identifiers

I went back to LibreOffice and added some tables that had lower case and mixed case table names and column names. I extracted the 5 database files from the .odb file, as described above. Then, in MMA, I created a connection, exactly as above, and got the following results:

TableForm@SQLColumnNames[conn, "TestResults"]
quote = Last@FirstCase[info, {"IdentifierQuoteString", _}];
sqlcmd = StringJoin[{"SELECT COUNT(*) FROM ", quote,
   "TestResults", quote}]
TableForm@SQLExecute[conn, sqlcmd]
sqlcmd = StringJoin[{"SELECT TOP 10 * FROM ",
   quote, "TestResults", quote,
   " ORDER BY ", quote, "ColumnNo2", quote}]
TableForm@SQLExecute[conn, sqlcmd]

TestResults ColumnNo1
TestResults ColumnNo2
TestResults ColumnNo3


123 data-123    123.4
987 data-987    987.6

The first SQLColumnNames[] lists the 3 table name / column name pairs. The table name for this query is "TestResults" and MMA recognizes the mixed case identifier. To access the data, we must put the mixed case identifiers in quotes.

The second command selects the appropriate quote character from the info variable, which is computed in the first section of code above.

The third command is a StringJoin[] that assembles an SQL command with quote characters on the table name. A simpler way to do this would be to embed \" in the string, but I didn't think of that until later. The fourth command in another SQLExecute[], with the result that there are 2 rows of data in the table.

The fifth command is another StringJoin[] that creates a second SQL command to access the data. In this second SQL the quotes are on both the table name and the column name. The column name is "ColumnNo2". The final SQLExecute[] executes the SQL command, which produces two lines of data from the table.

This same approach worked on a database table that had all lower case table names and column names. Because we must put that quote character on our mixed case identifiers, this method is a little messy. The simplest thing that works is to use all upper case identifiers, if possible.

Database Cleanup

When I closed the connection, I took a look at the file names that were left. I noticed MMA left behind one "tmp" directory and an "lck" file. Here's the MMA code that closed the database connection and showed me the new file and the new directory:

ColumnForm[FileNameTake[#, -1] & /@
     FileNames["db_137967*", NotebookDirectory[]]

 db_137967.tmp  *)

So, that is about it. Some of it is a bit messy, but a shell script could greatly simplify the database extraction process. It would be nice to know how well this approach works with large data sets. Does the need to unzip the archive become an impediment?


Posted 2017-02-17T03:00:37.563

Reputation: 8 428

1The key that was missing is that table names need to be upper-case. Great finding. I could reproduce your solution on an example DB. I put another bounty for you because of the nicely written answer. Thank you very much. – halirutan – 2017-03-19T11:44:52.370

Thank you very much from me as well! (Can confirm works exactly as you describe with Mma 11, LibreOffice/ Xubuntu-16, and that table names need to be UC (found out the hard way). In addition, on my system, LibreOffice Base was not installed by default

– user1066 – 2017-03-19T19:07:44.100

Strangely(?), the SQLExecute command is case-insensitive as long as the table name is UC. SQLExecute[conn, "SELECT * FROM myProDucTs"] works as long as tname is MYPRODUCTS. In addition, changing a lowercase table name to uppercase in the your_db_name.script file (two entries) and starting a fresh Mathematica kernel works for me. (The table name may be changed to anything as long as it it UC). – user1066 – 2017-03-21T14:21:41.577