Skip to main content

Multimedia Knowledge Representation and Management : "Brain Annex"

 (Updated Nov. 2022)

Wouldn't it be fantastic to have a "butler" to help us as we constantly face drowning in information?

That need was crushingly pressing for me, as a polymath with a thirst for knowledge in several fields, not to mention numerous very technical jobs over the years, several complex research projects, old notes from college and grad school, an endless stream of online courses I take, a tech startup I founded and used to run, the many conferences I attend, life in general, and even hobbies that tend to generate abundant information (such as flying airplanes and studying multiple foreign languages!)  

I was immensely eager for some sort of powerful assistance, something so helpful that I could poetically describe as an "annex" to my brain..

In this blog entry, I'll describe how deep frustration with existing software tools led to the start of the open-source BrainAnnex.org project, a web-based knowledge representation and management application.

The Problem

Knowledge comes in many forms.  Notes written out in words.  Tabular data.  Semi-structured data.  Images.  Plots.  Documents.  Audio.  Video.  Meta-data about other bits of knowledge.  Relationships between different notes....

Word processors are great to typeset your novel, but of mediocre help for structured or semi-structured data, such as the many little ideas about tidbits to include in the novel.  And they are unwieldy about managing images, never mind other types of knowledge.

Likewise, spreadsheets are as sleek as a penguin in water for the specific tasks they were designed for, but as clumsy as a penguin wobbling on land if used in other ways!

And traditional relational databases tend to require more expertise, beyond casual users; that's not a problem for me personally - in fact I worked as a database administrator for years - but databases can be rather unwieldy.

We've all been there, in our personal data management, and at work : we end up generating a lot of files - in particular Word docs, spreadsheets and images - and resort to using the file system to try to "tame" them in some fashion.  A battle easily lost, as evidenced by how many files typically end up littering the computer desktop.

Also, the OS file system is most definitely not meant to be a content management system.  In particular, the tree structure forces to choose one - and one only - location for a file or folder.  If you want multiple parents (categories) for a file or folder, you need to resort to creating very unwieldy "shortcuts" (or their counterparts in various operating systems), which are generally "second-class citizens"; for example, in Windows, shortcut folders are listed alphabetically after all the "regular" folders.

Beyond Content Management Systems

Content Management Systems are not a new idea.  But they are often specialized; for example, I worked with - and also developed - some Lab Information Management System (LIMS.)

Many of the more general-purpose content management systems aren't particularly popular - for good reasons, in my opinion.  I will contrast-and-compare some in future blog entries, but for now I'll just say that my jaw dropped when I tried out Microsoft One Note!  Not particularly useful - other than for laughs - and so no surprise that it never really took off, even with MS behind it...

Then, in 2015, after years of dissatisfaction with existing knowledge-management tools, I took the software platform that I had developed over the previous decade for a fairly elaborate movie information management system, and decided to apply it towards creating a general knowledge representation and management tool, to be eventually released as open source.

That's how the BrainAnnex.org project was born.

What is Brain Annex?

 Brain Annex is a web-based tool (which can also run locally) for Knowledge and Media Management.  It's very integrative, bringing in several open-source tools, and it's highly extendable with plugins.

There is no point to repeat here what's said on the official BrainAnnex.org  site, but I'll just mention one blurb:
Freely mix up formatted text notes, documents, images, graphics, data tables, spreadsheets, video, music – synergistically used together with the power of Semantic Technology. For example, to manage:
  • class notes
  • work/research notes
  • notes from hobbies
  • notes from everyday life (bills, medical)
  • personal-growth notes (journaling, progress)
  • music collection
  • photo/video collection (personal, work or research albums; artist porfolios)
  • website bookmarks (with optional annotations)
  • documents (the contents of any uploaded PDF files become searchable)
  • new plugins for special needs can be created easily
  • optionally sharing some of of your pages with others or making them public

Here's a 3-min intro video, aimed at the general public and casual users:

 

Above All, Something that is Useful

In various blog entries I'll be talking about interesting designs and technology behind knowledge representation and management, but the first-and-foremost force behind Brain Annex has been to create something extremely useful.  Remember the "butler" analogy I made at the beginning?  Brain Annex isn't meant to be a show-off for technology, or a playground for programmers; it's meant to be something of practical utility!

Since this is a blog about Science and Technology, the focus will be on the technologies behind Brain Annex - for example, Semantic Technology and Classes that have code (plugins) associated to them.  For more practical information about Brain Annex, including what it does and how to use it, I'll refer to the official BrainAnnex.org  site.

I am happy to say that, without exaggeration, Brain Annex has been a life-changer for me, in terms of managing my knowledge - for my research projects, jobs, courses, hobbies, and life in general.  I've used it essentially daily for the past 6 years.  Down the line, the plan is to release it as open source (UPDATE: done, see below), and hopefully help out others.

 

Feb. 2021 UPDATE: a radically-revamped new major version of Brain Annex is currently under development, and is expected later this year.  It keeps the same general feel while overhauling and modernizing the technology internally used: PHP gives way to Python/Flask, and MySQL and a simple Triplestore give way to the graph database Neo4j.  Also, the front end is getting a major face life with Vue.js  .  Management of general datasets will be much more powerful.  The promised full release to open source will happen with this version.

Feb. 2022 UPDATE: the release of the complete redesign of Brain Annex - now in beta - took place as promised last year.  GitHub repository.

Nov. 2022 UPDATE: The overhauled new Brain Annex is  explained in this short video.

Comments

Popular posts from this blog

Discussing Neuroscience with ChatGPT

UPDATED Apr. 2023 - I'm excited by ChatGPT 's possibilities in terms of facilitating advanced learning .  For example, I got enlightening answers to questions that I had confronted when I first studied neuroscience.  The examples below are taken from a very recent session I had with ChatGPT (mid Jan. 2023.) Source: https://neurosciencestuff.tumblr.com In case you're not familiar with ChatGPT, it's a very sophisticated "chatbot" - though, if you call it that way, it'll correct you!  'I am not a "chatbot", I am a language model, a sophisticated type of AI algorithm trained on vast amounts of text data to generate human-like text'. For a high-level explanation of how ChatGPT actually works - which also gives immense insight into its weaknesses, there's an excellent late Jan. 2023 talk by Stephen Wolfram, the brilliant author of the Mathematica software and of Wolfram Alpha , a product that could be combined with ChatGPT to imp

Using Schema in Graph Databases such as Neo4j

UPDATED Feb. 2024 - Graph databases have an easygoing laissez-faire attitude: "express yourself (almost) however you want"... By contrast, relational databases come across with an attitude like a micro-manager:  "my way or the highway"... Is there a way to take the best of both worlds and distance oneself from their respective excesses, as best suited for one's needs?  A way to marry the flexibility of Graph Databases and the discipline of Relational Databases? This article is part 5 of a growing,  ongoing  series  on Graph Databases and Neo4j Let's Get Concrete Consider a simple scenario with scientific data such as the Sample, Experiment, Study, Run Result , where Samples are used in Experiments, and where Experiments are part of Studies and produce Run Results.  That’s all very easy and intuitive to represent and store in a Labeled Graph Database such as Neo4j .   For example, a rough draft might go like this:   The “labels” (black tags) represent

Graph Databases (Neo4j) - a revolution in modeling the real world!

UPDATED Oct. 2023 - I was "married" to Relational Databases for many years... and it was a good "relationship" full of love and productivity - but SOMETHING WAS MISSING! Let me backtrack.   In college, I got a hint of the "pre-relational database" days...  Mercifully, that was largely before my time, but  - primarily through a class - I got a taste of what the world was like before relational databases.  It's an understatement to say: YUCK! Gratitude for the power and convenience of Relational Databases and SQL - and relief at having narrowly averted life before it! - made me an instant mega-fan of that technology.  And for many years I held various jobs that, directly or indirectly, made use of MySQL and other relational databases - whether as a Database Administrator, Full-Stack Developer, Data Scientist, CTO or various other roles. UPDATE: This article is now part 1 of a growing, ongoing series on Graph Databases and Neo4j But ther

Anti-Aging Research: Science, not Hype

Last updated May 2023 Q: "How is aging a disease?" A: It's a dynamic system that veers away from its homeostasis (normal equilibrium point): hence a form of slow-progressing illness. Labeling it as 'natural' is a surrender to our traditional state of ignorance and powerlessness, which fortunately is beginning to be changed! Aging is "normal" only from the point of view of the "selfish gene", for whom the body is a disposable carrier. Individuals organisms - for whom self-preservation has a different meaning than for genes - have received scant help from evolution... with rare exceptions such as the T. dohrnii jellyfish (which I discuss here )... but now the time has finally arrived for our rational design to remedy some of the cellular flaws that evolution never bothered to correct!   The above is my standard answer to an oft-asked question. The science of aging is by all evidence very misunderstood by the general public.  Hype,

What are Graph Databases - and Why Should I Care?? : "Graph Databases for Poets"

  This is a very gentle introduction to the subject.  The subtitle is inspired by university courses such as "Physics for Poets"!  (if you're technically inclined, there's an alternate article for you.) It has been said that "The language of physics (or of God) is math".  On a similar note, it could be said that: The language of the biological world - or of any subject or endeavor involving complexity - is networks ('meshes') What is a network?  Think of  it as the familiar 'friends of friends' diagram from social media. Everywhere one turns in biology, there's a network – at the cellular level, tissue level, organ level, ecosystem level.  The weather and other earth systems are networks.  Human societal organization is a network.  Electrical circuits, the Internet, our own brains...  Networks are everywhere! What can we do with networks, to better understand the world around us, or to create something that we need? Broadly s

Using Neo4j with Python : the Open-Source Library "NeoAccess"

So, you want to build a python app or Jupyter notebook to utilize Neo4j, but aren't too keen on coding a lot of string manipulation to programmatic create ad-hoc Cypher queries?   You're in the right place: the NeoAccess library can do take care of all that, sparing you from lengthy, error-prone development that requires substantial graph-database and software-development expertise! This article is part 4 of a growing,  ongoing  series  on Graph Databases and Neo4j   "NeoAccess" is the bottom layer of the technology stack provided by the BrainAnnex open-source project .  All layers are very modular, and the NeoAccess library may also be used by itself , entirely separately from the rest of the technology stack.  (A diagram of the full stack is shown later in this article.) NeoAccess interacts with the Neo4j Python driver , which is provided by the Neo4j company, to access the database from Python; the API to access that driver is very powerful, but complex - and does

Neo4j Sandbox Tutorial : try Neo4j and learn Cypher - free and easy!

So, you have an itch to test-drive Neo4j and its Cypher query language.  Maybe you want to learn it, or evaluate it, or introduce colleagues/clients to it.  And you wish for: fast, simple and free! Well, good news: the Neo4j company kindly provides a free, short-term hosted solution called "the Neo4j sandbox" .  Extremely easy to set up and use! This article is part 2 of a growing, ongoing series on Graph Databases and Neo4j Register (free) for the Neo4j "Sandbox" Go to sandbox.neo4j.com , and register with a working email and a password.  That's it! Note that this same email/password will also let you into the Neo4j Community Forums and Support ; the same login for all: very convenient! Launch your instance - blank or pre-populated After registering, go to  sandbox.neo4j.com  , and follow the steps in the diagram below (the choices might differ, but the "Blank Sandbox" should always be there): Too good to be true?  Is there

Visualization of Graph Databases Using Cytoscape.js

(UPDATED APR. 2024)   I have ample evidence from multiple sources that there are strong unmet needs in the area of visualization of graph databases. And whenever there's a vacuum, vendors circle like vultures - with incomplete, non-customizable, and at times ridiculously expensive, closed-box proprietary solutions.   Fortunately, coming to the rescue is the awesome open-source cytoscape.js library ,  an offshoot of the "Cytoscape" project of the  Institute for Systems Biology , a project with a long history that goes back to 2002. One can do amazing custom solutions, relatively easily, when one combines this Cytoscape library with:   1) a front-end framework such as Vue.js   2) backend libraries (for example in python) to prepare and serve the data   For example, a while back I created a visualizer for networks of chemical reactions, for another open-source project I lead ( life123.science )   This visualizer will look and feel generally familiar to anyone who has eve

Neo4j & Cypher Tutorial : Getting Started with a Graph Database and its Query Language

You have a general idea of what Graph Databases - and Neo4j in particular - are...  But how to get started?  Read on! This article is part 3 of a growing,  ongoing  series  on Graph Databases and Neo4j   If you're new to graph databases, please check out part 1 for an intro and motivation about them.  There, we discussed an example about an extremely simple database involving actors, movies and directors...  and saw how easy the Cypher query language makes it to answer questions such as "which directors have worked with Tom Hanks in 2016" - questions that, when done with relational databases and SQL, turn into a monster of a query and an overly-complicated data model involving a whopping 5 tables! In this tutorial, we will actually carry out that query - and get acquainted with Cypher and the Neo4j browser interface in the process.  This is the dataset we'll be constructing: Get the database in place If you don't already have a database installed locally

Full-Text Search with the Neo4j Graph Database

(UPDATED May 2024)   Now that we have discussed a full technology stack based on Neo4j (or other graph databases), and that we a design and implementation available from the open-source project BrainAnnex.org  , what next?  What shall we build on top? Well, how about  Full-Text Search ?  This article is part of a growing, ongoing series on Graph Databases and Neo4j Full-Text Searching/Indexing The Brain Annex open-source project includes an implementation of a design that uses the convenient services of its Schema Layer , to provide indexing of word-based documents using Neo4j. The python class FullTextIndexing ( source code ) provides the necessary methods, and it can parse both plain-text and HTML documents (for example, used in "formatted notes"); parsing of PDF files and other formats will be added at a later date. No grammatical analysis ( stemming or lemmatizing ) is done on the text.  However, a long list of common word ("stop words") that g