Archive for the ‘Typography’ Category

Big Data and the Topologist

Monday, May 6th, 2013

Big Data and the Topologist by Jesse Johnson.

From the post:

As I mentioned in my previous post, I plan to write a series of posts about study of large data sets, both the ways that high dimensional data has traditionally been studied and the topology that has recently been applied to this area. For anyone who has experience thinking about abstract geometric objects (as I assume most of the readers of this blog do) the concepts should seem pretty straightforward, and the difficulty is mostly in translation. So I will start with a post that focusses on defining terms. (Update: I’ve started a second blog The Shape of Data to look into these topics in more detail.)

You can also skip ahead to later posts in this series:

  1. The geometry of neural networks
  2. Principle component analysis
  3. Making linear data algorithms less linear: kernels
  4. Topological exploration of data sets: persistent homology
  5. Refining information from persistent homology
  6. Finding clusters in data
  7. Topological Clustering
  8. Digital spheres and Reeb graphs

The study of large data sets in the abstract generally goes by two names: Data mining is the field that grew out of statistics, which considers ways to organize and summarize high dimensional data so that it can be understood by humans. Machine Learning is the subfield of computer science (particularly artificial intelligence) that looks for ways to have computers organize and summarize data, with the goal of having the computer make decisions. These two fields have a lot in common and I will not try to distinguish between them. There are also names for the application of these methods in different sciences, such as Bioinformatics and Cheminformatics. There are also rather notorious applications in marketing, which allow stores to know what you’re going to buy before you know.

We are given a collection of data, usually a set of ordered n-tuples, that come from a science experiment or surveys, or the data that retailers collect about you every time you use your credit card, etc. Some of the entries can be thought of as labels – the code number for the particular experiment, for example. The remaining coordinates/dimensions are often called features. If these features are numerical, then we can think of them as defining vectors in a Euclidean space, and this gives us our first glimpse of geometry. However, for high dimensional data, the Euclidean metric turns out to be problematic, so we will often want to use a different metric. The Euclidean metric is also problematic for binary features such as the presence of different genes in an organism.

This is a nice way to start the week!

Fuse: From Invention to Antimatter

Wednesday, August 1st, 2012

Fuse: From Invention to Antimatter (Amazon Link)

James Cheshire has his brother review “Fuse: From Invention to Antimatter,” under the blog title: Book Review: Fuse.

From the post:

“In a world of generic mediocrity and corporate obeyance, new flowers of exuberance bloom in dark crevices. FUSE is a breach in the wall, a genetic mutation from which new lifeforms can spring [...] Never before has FUSE been so relevant and so necessary.”

The words of Neville Brody open FUSE 1-20, From Invention to Antimatter: 20 years of FUSE with the air of positive aggression and idealism that continues throughout the book. Across twenty editions (since 1991) FUSE has sought to challenge and invigorate the language of typography. Always contained within a cardboard box, each themed issue featured written editorials from leading designers, posters and a disc with four or more fonts for personal use and exploration. This new book (within a FUSE box) from Taschen is essentially a retrospective of all the FUSE editions to date, along with additional essays, conference transcripts, and two new issues – FUSE19 and FUSE20.

Typography, like page layout, is one of those things most of us pass over without realizing its impact on communication or even understanding itself.

Definitely on my priority order list!

BTW, do note that Amazon lists as: “Fuse: Neville Brody,” Chesire’s review gives both “Fuse: From Invention to Antimatter,” and “FUSE: 1-20, From Invention to Antimatter: 20 years of FUSE,” and no doubt other variations will abound.

TEX line breaking algorithm in JavaScript

Thursday, September 1st, 2011

TEX line breaking algorithm in JavaScript by Bram Stein.

From the post:

This is an implementation of the Knuth and Plass line breaking algorithm using JavaScript and the HTML5 canvas element. The goal of this implementation is to optimally set justified text in the new HTML5 canvas element, and ultimately provide a library for various line breaking algorithms in JavaScript.

This is very impressive and will reassure your topic map clients that you pay attention to details. Work remains to be done here and elsewhere on browser displays.

This was forwarded to me by Sam Hunting.

TeXBlog – Typography with TeX and LaTeX

Monday, February 28th, 2011

TeXBlog – Typography with TeX and LaTeX

I mention this blog for several reasons.

TeX and LaTeX would benefit from the production of a topic map that eased users from less capable systems to a more full featured publication system.

To that extent, this blog would be an excellent starting place for gathering resources for such an effort.

Most of the major academic houses require the use of TeX or LaTeX for publications so if you want to publish about topic maps, that knowledge is a presumed starting point.

Knowledge of TeX and LaTeX will give you an example of how a well designed system can prosper and grow over time. Something to aspire to.