Master Thesis Dev Blog #12 – Last Post

January 16th, 2011

Dear All,

I am happy to announce that I passed my colloquium and hold now the master degree in International Media and Computing.

I am really, really, really happy that I have finished my master and can now look forward to new challenges! It will be interesting to see what job opportunities are out there and that I can accept.

I would like to take the opportunity to thank all of you who accompanied, helped and supported me during the whole study. THANK YOU!

There is not much left to say. This will be the last Blog entry. Soon after this last entry, the blog will be closed in the near future.

Wish you all a wonderful time and thanks again.

Brian

Master Thesis Dev Blog #11

January 8th, 2011

I am better not speaking about the long break between my posts ;-)

But there is good news everyone – I have already handed in my thesis and on January 11 I will have my talk and defense. Looking forward to this. Quite nerveous about it, too. Some things to prepare, still and progress is slow. Anyway, will give an update on my colloquium next week.

Stay tuned and take care everyone

Brian

Master Thesis Dev Blog #10

June 15th, 2010

I have been silent quite a while. Reason for that is, that I have to think a  lot about my thesis lately. Some ideas and concepts I had on my mind were not suitable for the thesis. Anyway, I am back on track now and have a plan for my thesis. The plan for my tagging system is set up and I am working on the web app part for a start now.

I am currently taking a look at Groovy and Grails and am impressed. The feeling of programming with this web application framework is really similar to ruby on rails with the benefit that I am more familiar with java than ruby. Even better is, a prototype I created as a pure java desktop application was easily transferable to groovy within very short time.

My hope is that the future work with this framework is going well, too. One point I have to take a closer look is the conversion from a groovy and grails app to a java enterprise app. I have no clear understanding on that at the moment and have to do some more research on that.

You may wonder why I was looking into such a framework at all- the answer is, that I was planning on using a web framework anyway as my application should be available from a central location. I knew there were nice frameworks available like spring, struts and hibernate. Furthermore I had already worked with some of those and knew the setup and work with those will be hard for me. Therefore I gave groovy and grails a shot, as this framework is building on spring and hibernate anyway. So far I am very pleased with my success and the groovy and grails.

I will keep you posted on my progress, but for the moment I will do not weekly updates but be more flexible ;-)

Master Thesis Dev Blog #9

May 18th, 2010

I finished my first versions of the functional descriptions and handed them over to my supervisors to take a look at. Although I wrote I wanted to make some programming exercises I wasn’t able to get to that. I worked my way through the Microsoft Office Word binary format description to figure out that this is very hard to understand and even harder to get useful information for an implementation of a Word document reader.  I am looking into applications who are capable of reading different file formats including MS Office files, otherwise I would have to write two different thesis – one for my original topic and the other one about the MS Office binary format ;-) At the moment I have Tika from the Apache foundation which seems capable of reading MS Office binary formats. I will test that as a next step.

Another direction of research is the handling of metadata I will extract from the documents. One of the books I ordered is having a lot of useful information about that. RDF and OWL seem to be kind of a standard for handling metadata and then use those information for semantic web. Reading all the stuff is adding much more information I have to sort out. It is also important if this can be used with an existing system at my company.

Although my progress is slow at the moment I am confident to get a good system at the end. At the moment all the different pieces are fitting quite nicely together to a complete system.

Master Thesis Dev Blog #8

May 7th, 2010

This week went not very well for me. At the beginning of the week I got a cold including fever.  and I have a bad cough now. Therefore I was a bit limited in my actions. But now the good news- I read some very nice articles about meta data for learning objects which is quite similar to what I want to do. That  helped to get a better understanding and a good picture on that topic.

I worked on the functional description of the tasks the system should do. That is always a challenge as to describe these items. I will need some more iterations before I am happy with the result. Finally I was able to find documentation on how to work with Flash movies and the office documents. I will need to get the text and if I am able to work with information associated with the text like font size this can only be good for the project.

I ordered some more books now on text/data mining and looking forward to read them. Else I will work on the functional description and do some first programming exercises.

Master Thesis Dev Blog #7

April 30th, 2010

This week I do not have that much to write. I collected and wrote down a few pages about tagging, content which will be scanned and pulled together a list of all the facts that seem interesting to me and important for my thesis. There are still some articles to read but focus now on a different area. Where I read more about tagging and pros and cons in the beginning I shift now my attention to text and data mining as well as semantic web. These topics focus more on the practical parts of my thesis. The other articles were more interesting in terms of the theory and usefulness of tagging in general. Either today or tomorrow I will furthermore start writing the functional requirements for my application.

One thing I am wondering about is if I read to much about tagging and if it useful or not. Not sure if I maybe should have switched to the more practical issues earlier. I will have to assess that after writing my thesis.

I am waiting still for some feedback of the Document Management Company. Otherwise I have a plan B if something goes wrong. I am looking forward to the next couple of days as I have the feeling that with focusing more on the practical parts I will generate many new information for me and for you :)

So please stay tuned :o )

Master Thesis Dev Blog #6

April 23rd, 2010

It is time for my next blog entry about my master thesis.

I am still working my way through tons of articles and books. The most recent book I got is titled “Good Tags – Bad Tags”. I skimmed through it last night and this looks really promising as it has a good collection of interesting articles concerning tagging and also tagging in companies. I will take this book apart on the weekend and hope to get many useful information from that. Furthermore I have some more articles to read which also have some potential to help me for my thesis.

I am amazed how much information you can get about tagging for a wide range of areas. This is also written in one of the articles i read (sorry can not remember which one, will add this info, when I find this again). Anyway, at the beginning of my thesis I though there will be not much about that as these are only tags. Digging deeper into this field of research I realized quite fast that there is a lot to learn and much can be researched from social aspects of tagging to technical aspects.

Third item I want to write about is mind-mapping. I used that already some times to gather ideas about a certain topic. During the master seminar, which I have each week on Thursdays, a new facet was shown  me which I never used before. What I usually did was already structuring words that came to my mind and connected them. It seems also important to write down everything that comes to your mind and the structuring can be done afterward.  My professor posted some interesting links on a university web platform I will follow, as they have some more information on that topic.

Master Thesis Dev Blog #5

April 16th, 2010

I am busy reading all kinds of books and articles belonging to my thesis, to get a good basis for the practical work itself. Besides the practical work I have to build a strong case that my work will be useful for the employees of the company and adds some value. That is making my work more interesting as I have to work on technical issues but also on social ones.

Anyway, I wrote already a few lines about two topics of my thesis. After the review of my professor I know that I have to work harder on using footnotes and references, even in my early drafts. Maybe especially in my early drafts so that I do not miss that later in my thesis.

I wrote my first entries on my wiki and printed those short texts later. Tikiwiki has solved the multi print quite comfortably by integrating a feature where you can select different wiki pages. The selected pages are combined on one html web page which you can print easily. It is not necessary to print each wiki page separately.

At the beginning of the week I visited the FU Berlin library to get access to the ACM Digital library. This was really easy and I can only recommend to everyone who needs scientific documents to visit the FU Berlin. At least for students it is really easy to get access. If you are well prepared it does not need a lot of time to get all papers. You can search at home in the ACM before you have to visit the FU Berlin library. Most important to say about the ACM library papers, some of them are really good and worth the extra effort to go the the FU library to get them.

Master Thesis Dev Blog #4

April 11th, 2010

I started already to write a few things up for my thesis. Topics covered so far are Tagging and its beginning and Document Management Systems. It is not much, yet, but a good start. My schedule for the week is also settled and better than thought.

Next I will play a bit with Feng Office and Gantt Charts. I found a promising plugin to generate gantt charts from tasks in feng office. Hope I can get that installed, as this would make my life a bit easier. For Tuesday I plan to visit a few more libraries to get more books and articles. Furthermore I will finish up a use case diagram and schedule a meeting with my colleagures to get some feedback on that.

Update: I did take a look at adding Gantt Charts to Feng Office but looks as it is too much work at the moment and not worth for my purposes. Just for this one project I will live with separate Gantt Chart. But good to know that it is possible. Maybe if there is more time in the future…

Master Thesis Dev Blog #3

April 7th, 2010

Again, I am late for my blog entry. Now the new semester has started and I hope to get things straight, soon. Last week I pulled together a list of books and articles I want to read for my thesis. I visited one of the libraries and learned an important lesson- Do not trust that you get the books you want, although an hour ago they were available. Unfortunately that happened with 3 books. Anyway, I found two other interesting books so far I will start reading. Furthermore I will visit other libraries and get some more books. Additionallly to that I have to fix my schedule , project phases, and use case diagram.