Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

July 31, 2015

Robotic Article Tagging (OpenOffice Idea)

Filed under: OpenOffice,Tagging — Patrick Durusau @ 9:00 pm

The New York Times built a robot to help make article tagging easier by Justin Ellis.

From the post:

If you write online, you know that a final, tedious part of the process is adding tags to your story before sending it out to the wider world.

Tags and keywords in articles help readers dig deeper into related stories and topics, and give search audiences another way to discover stories. A Nieman Lab reader could go down a rabbit hole of tags, finding all our stories mentioning Snapchat, Nick Denton, or Mystery Science Theater 3000.

Those tags can also help newsrooms create new products and find inventive ways of collecting content. That’s one reason The New York Times Research and Development lab is experimenting with a new tool that automates the tagging process using machine learning — and does it in real time.

The Times R&D Editor tool analyzes text as it’s written and suggests tags along the way, in much the way that spell-check tools highlight misspelled words:

Great post but why not take the “…in much the way that spell-check tools highlight misspelled words” just a step further?

Apache OpenOffice already has spell-checking, so why not improve it to have automatic tagging?

You may or may not know that Open Document Format (ODF) 1.2 was just published as an ISO standard!

Which is the format used by Apache OpenOffice.

Open Document Format (ODF) 1.2 supports RDFa for inline metadata.

Now, imagine for a moment using standard office suite software (Apache OpenOffice) to choose a metadata dictionary and have your content automatically tagged as you type or to insert a document and tags are automatically inserted into the text.

Does that sound like a killer application for your corner of the woods?

A universal dictionary of RDFa tags might be a real memory hog but how many different tags would you need day to day? That’s even an empirical question that could be answered by indexing your documents for the past six (6) months.

With very little effort on the part of users, you can transform your documents from unstructured text to tagged (and proofed) text.

Assemble at the Apache OpenOffice (or LibreOffice) projects if an easy-to-use, easy-to-modify tagging system for office suite software appeals to you.

For other software projects supporting ODF, see: OpenDocument software.

PS: Work is current underway at the ODF TC (OASIS) on robust change tracking support. All we are missing is you.

July 23, 2013

Apache OpenOffice 4.0 Released!

Filed under: OpenOffice — Patrick Durusau @ 8:45 am

Apache OpenOffice 4.0 Released!

From the release notes:

Apache OpenOffice 4.0 is now available for download from our official download page. Building upon the great success of the OpenOffice 3.4 release, which has seen over 57 million downloads, this major update brings exciting new features, enhancements and bug fixes. OpenOffice 4.0 features an innovative new Sidebar user interface, additional language support for 22 languages (including 3 new languages), 500 bug fixes, improvements in Microsoft Office interoperability, enhancements to drawing/graphics, performance improvements, etc. You can read the details of these later in these Release Notes.

One other thing that you will notice immediately, on this page, our website and in other places, is our new logo. The new logo is the result of many design iterations, collaboration among talented designers and a contest where 40 logos were rated by over 50,000 users.

We wish to give a special “hello” to our upgrading users. We hope you like the new features, especially the Sidebar UI. This feature was based on code contributed by IBM, from their Lotus Symphony product. We adapted that code, and based on several rounds of feedback from users like you, improved it and integrated it into OpenOffice.

And for new users, welcome to the community! We hope you enjoy using Apache OpenOffice, the leading free and open office productivity suite.

As the OpenDocument Format (ODF) editor, I must confess a bias in favor of applications that use ODF. 😉

I am downloading a copy of Apache OpenOffice 4.0 in the background as I write this post.

You should too!

BTW, OpenDocument Format 1.3 (draft) is underway. If you have specific features or needs, contributing expertise (not just your request) to the ODF Technical Committee at OASIS will move applications such as Apache OpenOffice towards meeting your needs.

November 15, 2012

OpenOffice, RDF and Graphs

Filed under: Graphs,OpenOffice,RDF — Patrick Durusau @ 5:41 pm

Creating a virtual RDF graph describing a set of OpenOffice spreadsheets with Apache Jena and Fuseki

by Pierre Lindenbaum

From the post:

In the current post, I will use the Jena API for RDF to implement a virtual RDF graph describing the content of a set of openoffice/libreoffice spreasheets.

Now this is truly an intersection of interests for me!

BTW, ODF 1.2 has support for RDF based metadata. There hasn’t been a lot of implementation activity on it but that could change.

Particularly since the right application could allow users to do an “ontology/entity” check, just as you do a spell check now.

If you want to know what something means, why not ask the author?

I will be forwarding this link to the ODF crowd as we work towards ODF 1.3.

Highly recommended.

Powered by WordPress