OpenOffice ODF/.odt compared to Microsoft Word .doc

Overview

This is the first in a series of articles that will compare ODF and in particular the OpenOffice implementation and Microsoft Office and its various data formats with respect to various measures. This article will cover the efficiency of the .odt, .doc and .xml formats, with particular interest to native and compressible file sizes.

Methodology

My windows test cases were generated using the following software:

  • Microsoft Windows XP Professional 2002, SP2
  • Microsoft Word 2003 (11.6368.6368) SP2
  • OpenOffice 2.0.3
  • Adobe Acrobat Standard 7.0.8 5/16/2006.

My Linux test cases where produced with the following software:

  • SuSE Linux 10.1
  • OpenOffice 2.0.2.7.1
  • Adobe Reader 7.0.8 05/22/2006

I needed a fairly large chunk of text for my test, I decided on the November draft of the ISO/IEC C Standard, located at http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2005/n1905.pdf (copy here). This is a significantly large document, and I decided only to use the first seven chapters for my test case. In order to produce the target documents, I selected the contents from the beginning of the document through chapter 7, and copied this to the clipboard. I then pasted the clipboard into native versions of Microsoft Word under Windows and OpenOffice Writer under both Windows and Linux. For Microsoft Word, I saved the document as a native .doc and .xml. For OpenOffice, I saved the document as native .odt and exported it as .doc. I also saved the content as .txt with Notepad under Windows as a reference point. For archival purposes, I have mirrored all documents referred to in this article on the 3monkey wiki download area.

Raw Results

File Size
Microsoft Office .doc 921,088
Microsoft Office .xml 6,475,669
OpenOffice (XP) .odt 154,892
OpenOffice (XP) .doc 1,335,296
OpenOffice (Linux) .odt 160,045
OpenOffice (Linux) .doc 1,338,368
Notepad 417,549

Observations

My first observation was the Linux OpenOffice implementation created slightly larger file sizes than the Windows implementation. This was probably due to the differing versions. I will revisit this in a later article if it is merited.

My next observation was that the OpenOffice .doc file was significantly larger than the Microsoft Word version. This is likely due to Microsoft’s access to the complete .doc specification, and thus a better understanding of how to optimize the file content and size. For grins, I loaded the OpenOffice .doc with Microsoft Word and saved it naively. I also loaded the Microsoft Word .doc with OpenOffice and saved it both as a .doc and .odt. The results of these test are below.

File Size
OO .doc loaded/saved in MS 808,960
MS .doc loaded/saved in OO 1,277,952
MS .doc loaded/saved as .odt in OO 155,113

This produced some interesting results. First, even though the original OpenOffice .doc file was originally larger than the native Microsoft Word version, when loaded and saved with Word, resulted in a file 12% smaller file than the original native Word .doc. This indicates that OpenOffice does not save all of the information regarding a document that Word does. This is further supported by the opposite transformation. When we load the Word document in OpenOffice and re-save as a .doc, we experience a file size reduction again. This reduction, although not as significant, clearly supports the fact that OpenOffice is not saving all the information in its .doc format as Word. By a cursory visual inspection all of the documents seem to be equivalent. Without access to the .doc file format specification it is difficult to infer whether or not the information loss is of consequence or not. In other words the file size difference may be due to bloat in the native Word format or due information loss by OpenOffice.Next most people will notice that not only is the .odt versions smaller than then .doc versions irregardless of which application produced them. Further more the .odt is almost one-third the size of the raw text from notepad. The reason the .odt is so significantly smaller is the the OpenOffice implementation applies compression on its output, and obviously decompresses it on the fly for input. This has both as advantages and disadvantages. The primary disadvantage is load and save times. Since the file must be either compressed or decompressed, this takes extra CPU cycles. However, with the speed and efficiency of today’s processors, this should be of little practical impact. The one obvious major advantage is file size. Not only does this save in raw disk storage, but also results in lower bandwidth for such mediums as email and downloads.I wondered what would the results be of compressing the .doc, .xml, .odt and .txt? I compressed all four formats using the Linux utility zip (as that is the underlining implementation for OpenOffice). The results (below) where fairly interesting and somewhat expected.

File Type Original Size Compressed Size
.doc 921,088 179,648
.xml 6,475,669 228,497
.odt 154,892 153,456
.txt 417,549 104,236

Notice that each format compress roughly to the same size. The .xml is larger due to both its original size thus the number of segments that needed to be compressed and additional data compared to the other formats. The .doc is roughly 15% larger than the .odt, which was only slightly compressed (perhaps to a slight algorithm change). The .txt compressed more than the others, this is due to the fact that it cares on formatting, style or meta information and is simply the raw text. Seeing the vastly decreased storage in respect to the .doc, I wonder why Microsoft does not inappropriate a compression strategy similar to OpenOffice.

Conclusion

From this limited data sample, I have to declare OpenOffice Writer the champion of round one. Perhaps if Microsoft Word employed a compressed output form the outcome may have been different. It is actually a little strange that OpenOffice which is based on a pure text format (XML) is compressed into a binary zip file and that Microsoft Word, which is a proprietary binary format is not.

What Is Up Next?

For the most part these test cases did not contain much formatting or style information, nor did it consider such elements as tables and graphs. I will investigate how these effect the efficiency in a latter article. But before I do that, I will need to expose more of how ODF works. Therefore, the next few articles in this series will be a primer for the ODF specification.

Until next time…
-3Monkeys

Popularity: 100% [?]

  • DZone
  • StumbleUpon
  • Technorati
  • del.icio.us
  • Slashdot
  • Digg
  • Reddit
  • NewsVine
  • SphereIt
  • e-mail
  • Facebook
  • Google Bookmarks
  • Live
  • Propeller
1 Star2 Stars3 Stars4 Stars5 Stars6 Stars7 Stars8 Stars9 Stars10 Stars (35 votes, average: 6.57 out of 10)
Loading ... Loading ...

61 Responses to “OpenOffice ODF/.odt compared to Microsoft Word .doc”

  1. meneame.net Says:

    OpenOffice odt versus Microsoft doc…

    Comparativa en ingles entre los formatos ODT y DOC….

  2. 3monkeys » OpenOffice: .odt Opened Up Says:

    [...] In the first article in this series, OpenOffice ODF/.odt compared to Microsoft Word .doc, I compared various file types for size efficiency. Of particular interest was the fact that OpenOffice Write stores .odts in a zip format, an implementation of PKZip to be exact. With this knowledge and the Open Document Format standard, we can investigate how certain elements of a document effect its size and overall efficiency. [...]

  3. Heliologue Says:

    In all fairness, you should probably be comparing ODF against Microsoft’s new XML format, which is less apples-to-oranges.

  4. Woooops Says:

    Which is better shouldn’t be judged by size.

  5. Hildegard Jasper Says:

    Krall…

    Useful, thank you!…

  6. Kevin Says:

    I think you meant “regardless” in the third paragraph under “Observations.” Irregardless means “with regard.”

  7. OpenOffice ODT Microsoft DOC ile kar??la?t?r?lm?? | Etixet Says:

    [...] Payla?mam gerekti?ini dü?ünüyorum. Ama telif hakk? meselesi nedeniyle özür dileyerek ingilizcesini veriyorum. Tablolar? bile inceleyip anlaman?z yeterli (S?ras?yla Dosya Tipleri ve [...]

  8. Jake Says:

    Kevin: I cannot find a single dictionary that says irregardless means with regard. My print dictionary says see: Regardless. Meriam Webster online says “nonstandard : regardless”. All of these definitions: http://dictionary.reference.com/browse/irregardless say that is regardless.

    It is nonstandard prehaps, but it does oddly mean regardless nonetheless.

  9. Jenrose Says:

    I discovered this with a file that saved to 300k or thereabouts in ODT, and almost 900 in TXT.

    Mind boggling.

  10. JoeG Says:

    The origin of irregardless is not known for certain, but the consensus among references is that it is a blend of irrespective and regardless, both of which are commonly accepted standard English words. By blending these words, an illogical word is created. “Since the prefix ir- means ‘not’ (as it does with irrespective), and the suffix -less means ‘without,’ irregardless is a double negative.”[1]

    thats from wiki.
    Basically irregardless is a bastardization of the english language
    derived from a lazy crunch of two words when one isnt sure which to use.
    and according to my father lol, its like nails on a chalkboard.

  11. Sean Says:

    LOL! That comment about irregardless reminds me of when my wife used to say “could you refrain from not doing that?” So I comply. I keep right on doing it. It used to really get her mad just because I listened to her! She hasn’t done that in a long time so I guess she got the hint.

  12. Jeff Says:

    Thanks for the information!

    I too would not necessarily say that OpenOffice wins round one. Compressing the documents is obviously not a priority for Microsoft and I think for good reason. I use OpenOffice almost exclusively but I have notice significantly longer load times. I did not understand why until I read your article. If anyone is emailing either doc or odt they’re idiots. Fortunately, OO makes it very easy to generate a pdf! Now, is there a way to tell OO not to compress the odt? I would guess not.

  13. Joan Thoby Says:

    Decent text. Like your blog design as well. continue your good work.

  14. Barbara Loans Says:

    Hey, awesome website. I came across this on Yahoo, and I am stoked that I did. I will definately be returning here more often. Wish I could add to the conversation and bring a bit more to the post, but am just absorbing as much info as I can at the moment.

  15. Jake Rhymes Says:

    Nice post. The information presented here was the best I could find all day lengthy, and I have been searching hard on the Internet. I believe you ought to put this up on a large social bookmarking site, you will discover that it spreads like wildfire – Cheers – dave

  16. Mixer Shower Says:

    ~,; that seems to be a great topic, i really love it `*:

  17. EnvisionStar Says:

    Great post! You should host your site at EnvisionStar.com, you can signup with the for just ONE CENT by clicking the “Specials” link at the top of their website: http://www.envisionstar.com/aff/4244

  18. Sheldon Dalin Says:

    Good to become browsing an individual’s blog page when a great deal more, it has become several weeks in my opinion. Nicely the submit in which i’ve become continued to wait for the purpose of which means that very long. I’d like to see the write-up to complete this task inside of these university, not to mention it’s exact subject together with your write-up. Thanks, awesome discuss..

  19. gf12 Says:

    I would like to voice my gratitude for your kind-heartedness for all those that must have assistance with this particular topic. Your real commitment to getting the solution all through ended up being surprisingly informative and has frequently enabled regular people like me to reach their targets. Your important guide entails much a person like me and additionally to my office colleagues. Thanks a ton; from everyone of us.

  20. iphone Says:

    Good post. I be taught one thing tougher on different blogs everyday. It can at all times be stimulating to learn content material from different writers and observe a little bit one thing from their store. I’d favor to make use of some with the content material on my weblog whether or not you don’t mind. Natually I’ll provide you with a hyperlink on your internet blog. Thanks for sharing.

  21. Domain Says:

    There are some fascinating cut-off dates on this article however I don’t know if I see all of them middle to heart. There is some validity but I’ll take hold opinion till I look into it further. Good article , thanks and we would like extra! Added to FeedBurner as properly

  22. Jenifer Poulter Says:

    Fantastic website. A lot of useful info here. I’m sending it to some friends ans also sharing in delicious. And of course, thanks for your sweat!

  23. Call India Cheap Says:

    I am glad to be a visitant of this unadulterated web blog, thank you for this rare info!

  24. Call India cheap from Canada Says:

    Nice blog right here! Also your website so much up fast! What web host are you the use of? Can I am getting your associate link to your host? I wish my site loaded up as fast as yours lol.

  25. fungal nail treatment Says:

    Spot on with this write-up, I really assume this website wants much more consideration. I’ll probably be again to learn way more, thanks for that info.

  26. Manual Traffic Exchange Says:

    I’ve read a few good stuff here. Certainly price bookmarking for revisiting. I surprise how a lot attempt you put to make one of these magnificent informative site.

  27. Florencia Hupman Says:

    higest paying surveys online http://tinyurl.com/bu9rs9u

  28. bestsellers Says:

    There are some attention-grabbing cut-off dates in this article but I don’t know if I see all of them middle to heart. There may be some validity however I’ll take hold opinion until I look into it further. Good article , thanks and we would like extra! Added to FeedBurner as properly

  29. Deloris Brewen Says:

    I’ve recently started a website, the info you offer on this web site has helped me tremendously. Thanks for all of your time & work.

  30. Online Calculator Says:

    I have recently started a website, the info you offer on this website has helped me tremendously. Thanks for all of your time & work.

  31. Scientific Calculator Says:

    I would like to thank you for the efforts you’ve put in writing this web site. I am hoping the same high-grade web site post from you in the upcoming as well. Actually your creative writing skills has encouraged me to get my own website now. Actually the blogging is spreading its wings fast. Your write up is a good example of it.

  32. IP address find Says:

    Great – I should certainly pronounce, impressed with your web site. I had no trouble navigating through all the tabs and related info ended up being truly simple to do to access. I recently found what I hoped for before you know it in the least. Quite unusual. Is likely to appreciate it for those who add forums or anything, web site theme . a tones way for your client to communicate. Excellent task.

  33. wimax Says:

    Thanks for sharing superb informations. Your web-site is very cool. I’m impressed by the details that you have on this site. It reveals how nicely you understand this subject. Bookmarked this web page, will come back for more articles. You, my friend, ROCK! I found just the info I already searched all over the place and simply couldn’t come across. What a great website.

  34. Currency converter easy Says:

    I’m no longer positive the place you are getting your info, but great topic. I needs to spend some time finding out much more or understanding more. Thanks for wonderful information I used to be on the lookout for this information for my mission.

  35. Currency converter online Says:

    I have to show thanks to you just for bailing me out of this type of matter. After surfing around throughout the world-wide-web and obtaining ideas which are not productive, I figured my life was well over. Living without the solutions to the difficulties you have sorted out through the short post is a crucial case, and those that could have adversely damaged my career if I had not noticed your web site. Your own personal talents and kindness in touching all the things was tremendous. I don’t know what I would have done if I hadn’t discovered such a subject like this. It’s possible to at this time look ahead to my future. Thank you so much for your expert and effective guide. I won’t hesitate to propose your web blog to any person who will need counselling about this area.

  36. Ethernet cable splitter Says:

    Over again heavy read-up thanks lots for sharing, keep me posted I will be reading more of your read-ups in the approaching!

  37. rv repair Says:

    Good story once again. I am looking forward for your next post:)

  38. Total Web Traffic Says:

    magnificent points altogether, you simply received a new reader. What could you recommend about your publish that you just made a few days ago? Any certain?

  39. Madelyn Teichert Says:

    Simply want to say your article is as surprising. The clearness in your post is just spectacular and i could assume you’re an expert on this subject. Well with your permission let me to grab your feed to keep updated with forthcoming post. Thanks a million and please carry on the rewarding work.

  40. ??????? ??????? Says:

    Thank you for sharing excellent informations. Your web site is very cool. I am impressed by the details that you have on this web site. It reveals how nicely you understand this subject. Bookmarked this website page, will come back for extra articles. You, my friend, ROCK! I found just the info I already searched everywhere and simply couldn’t come across. What a perfect web-site.

  41. ??????? ???????? ? ??????? Says:

    I’ve read several just right stuff here. Definitely worth bookmarking for revisiting. I surprise how much effort you place to make the sort of magnificent informative web site.

  42. Movie T-Shirts Says:

    I and also my friends were reading the nice information and facts on your website while all of a sudden I got a horrible suspicion I had not expressed respect to the web blog owner for those techniques. Those young men are already absolutely glad to read all of them and already have clearly been enjoying those things. Appreciate your indeed being very helpful and also for picking such fabulous topics most people are really needing to discover. Our own honest apologies for not expressing appreciation to you earlier.

  43. Political T-Shirts Says:

    Thanks a bunch for sharing this with all people you really realize what you are speaking approximately! Bookmarked. Kindly additionally visit my website =). We may have a hyperlink alternate agreement between us!

  44. scanmyface Says:

    My brother suggested I would possibly like this blog. He was once totally right. This submit truly made my day. You can not believe simply how so much time I had spent for this info! Thanks!

  45. Plagiarism Checker Free Says:

    Nice post. I be taught something more challenging on totally different blogs everyday. It would always be stimulating to learn content from other writers and apply a little one thing from their store. I’d want to make use of some with the content on my blog whether you don’t mind. Natually I’ll provide you with a link on your internet blog. Thanks for sharing.

  46. ignace ontario hotels Says:

    I have to express my appreciation to you for bailing me out of such a scenario. As a result of looking through the internet and coming across suggestions that were not powerful, I assumed my life was gone. Existing without the presence of approaches to the difficulties you’ve fixed by way of your article content is a crucial case, and the ones which could have negatively damaged my entire career if I hadn’t come across your web page. Your good talents and kindness in controlling the whole lot was useful. I don’t know what I would have done if I hadn’t discovered such a subject like this. I am able to at this time look forward to my future. Thanks for your time very much for the professional and result oriented help. I will not be reluctant to suggest your web blog to any person who wants and needs support on this area.

  47. Open Office Says:

    Remember when we used to be nostalgic?

  48. computer repair mississauga Says:

    I haven’t checked in here for some time as I thought it was getting boring, but the last several posts are good quality so I guess I’ll add you back to my daily bloglist. You deserve it my friend :)

  49. Idell Amenta Says:

    Lovely, congratulation – Bruuuummmm

  50. hairdresser Says:

    There are some interesting closing dates on this article however I don’t know if I see all of them heart to heart. There is some validity but I’ll take maintain opinion until I look into it further. Good article , thanks and we want more! Added to FeedBurner as effectively

  51. Birth Certificates Says:

    I do not even know how I ended up right here, but I believed this publish was great. I do not understand who you’re however certainly you’re going to a famous blogger if you happen to aren’t already ;) Cheers!

  52. Jess Gough Says:

    I think other web-site proprietors should take this site as an model, very clean and excellent user genial style and design, let alone the content. You’re an expert in this topic!

  53. booty Says:

    Thanks for some other informative web site. The place else may I get that kind of information written in such a perfect approach? I have a mission that I am simply now operating on, and I have been at the look out for such info.

  54. 09391340 Says:

    I am glad to be a visitant of this stark weblog, thanks for this rare information!

  55. testujemy22 Says:

    Lovely, congratulation – Bruuuummmm

  56. Atlanta Chiropractor DOT Says:

    Thanks a lot for providing individuals with an extremely marvellous chance to read in detail from this blog. It’s usually so good and packed with a great time for me personally and my office colleagues to visit the blog at the least three times per week to find out the new secrets you have. And lastly, I’m just always pleased for the amazing tricks you give. Some 2 areas in this post are unquestionably the simplest I have ever had.

  57. Stone Mountain Chiropractor Says:

    My wife and i felt quite cheerful that Peter managed to complete his survey via the precious recommendations he grabbed in your web site. It’s not at all simplistic to just happen to be giving out tips that many others might have been selling. And we all discover we need the writer to appreciate for that. The type of illustrations you made, the simple web site menu, the relationships your site aid to promote – it’s got everything fabulous, and it is making our son in addition to us understand the theme is cool, and that is rather fundamental. Thanks for all the pieces!

  58. Rolando Rye Says:

    Mind-boggling Brad Pitt. He eventually fell for the actual attraction of Social Media. No, he isn’t on Twitter. Or is he on Facebook. His primary endeavor on the web is on Google +. Also, Brad Pitt has an unrestrainable giggling attack on the set of Moneyball. At least, he defintely won’t be fighting against himself in the best picture battle. Perhaps you’ve seen Oscar nominated Moneyball? The story of Oakland A’s gm Billy Beane’s prosperous attempt to put together a baseball team on a tight budget by utilizing computer-generated evaluation to be able to draft his players, starring Brad Pitt and Jonah Hill, directed by Bennett Miller. Instant download from Amazon http://goo.gl/Zwr7p

  59. referencement Says:

    Thank you for any other informative site. The place else may just I get that kind of info written in such an ideal approach? I’ve a mission that I am just now working on, and I’ve been on the look out for such information.

  60. Metaldetecting Says:

    Spot on with this write-up, I really think this web site needs way more consideration. I’ll most likely be again to read way more, thanks for that info.

  61. Tores Maris Says:

    Excellent weblog here! Additionally your site quite a bit up fast! What web host are you the usage of? Can I get your associate hyperlink to your host? I want my web site loaded up as fast as yours lol

Leave a Reply