Largest file you have ever seen (in size)?


Recommended Posts

Hey

Whats the largest file (size) you have ever encountered?

As a reminder, the following are the max sizes a file can be on some common (and not so common) file systems:

Tape for the Elektronika BK: 64 kB

FAT32: 4 GB

ext3: 2 TB

ext4: 16 TB

exFAT: 127 PB

HFS Plus: 8 EB

NTFS: 16 EB

GPFS: 512 YB

Could just generate a txt file with the same characters over and over in it to fill the largest size and then claim to be the victor of this question. Funny part is it would compress down to like 100k zipped since it's a repeating character and header info.

  On 19/10/2012 at 14:04, xendrome said:

Could just generate a txt file with the same characters over and over in it to fill the largest size and then claim to be the victor of this question. Funny part is it would compress down to like 100k zipped since it's a repeating character and header info.

  On 19/10/2012 at 14:06, Buttus said:

or make a gigantic virtualbox system file...

If you want to go ahead and waste your time to just post on this thread, go ahead.

Thing is, I doubt you have enough space, drives and/or need for 512 YB.....

I've dealt with some text files that are several hundred GB in size. They're mostly collections used for research.

I'm currently working with a 800GB text file containing nothing but Tweets. My biggest probably is actually just finding somewhere to store all the data. I doubt many people have dealt with files > 3TB simply because the storage is difficult (unless you're using tapes etc.).

  On 19/10/2012 at 14:12, Pong said:

I've dealt with some text files that are several hundred GB in size. They're mostly collections used for research.

I'm currently working with a 800GB text file containing nothing but Tweets. My biggest probably is actually just finding somewhere to store all the data. I doubt many people have dealt with files > 3TB simply because the storage is difficult (unless you're using tapes etc.).

Are these just tweets for tweets with information (Person who sent it, ID of tweet, time, etc)?

Just curious :)

  On 19/10/2012 at 14:04, xendrome said:

Could just generate a txt file with the same characters over and over in it to fill the largest size and then claim to be the victor of this question. Funny part is it would compress down to like 100k zipped since it's a repeating character and header info.

  On 19/10/2012 at 14:06, Buttus said:

or make a gigantic virtualbox system file...

Why do you guys have to make things so difficult?

"cat /dev/urandom > blob"

At least I would call it blob...

  On 19/10/2012 at 14:42, pes2013 said:

Are these just tweets for tweets with information (Person who sent it, ID of tweet, time, etc)?

Just curious :)

Basically, yes. I have access to the Twitter gardenhose, which spits out roughly 10% of all Tweets as they are posted.

This page lists all the details which are included with the ~34M Tweets I get to analyse every day:

https://dev.twitter.com/docs/platform-objects/tweets

  On 21/10/2012 at 12:49, pes2013 said:

I did not understand a word you said....

He's probably talking about the 42 kb zip file. I have it on my machine actually, as I'd like to test it out one day. Details:

One example of a Zip bomb is the file 42.zip which is a zip file consisting of 42 kilobytes of compressed data, containing five layers of nested zip files in sets of 16, each bottom layer archive containing a 4.3 gigabyte (4 294 967 295 bytes; ~ 3.99 GiB) file for a total of 4.5 petabytes (4 503 599 626 321 920 bytes; ~ 3.99 PiB) of uncompressed data.

Can read more about it here: http://en.wikipedia.org/wiki/Zip_bomb

  • Eric locked this topic
This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • This DDR5-6400 CL36 32GB RGB RAM that supports both AMD and Intel is only $72 by Sayan Sen Recently, we covered several SSD deals, both internal and external. These include the Crucial X9 Pro and X10 Pro, the P310 2280, and the Samsung 990 EVO Plus. Meanwhile if you are looking for RAM to upgrade your desktop PC or build a new one, the PNY XLR8 Gaming EPIC-X RGB DDR5-6400 Kit can be your go-to choice as it is currently just $72 (purchase link down below). In terms of specs, this dual-RAM kit delivers 32GB of total DDR5 capacity (each module is 16GB) running at 3200 Hz to deliver 6400 MT/s (PC5-51200) at 1.4 volts. Pre-programmed Intel XMP 3.0 and AMD EXPO memory overclocking preset profiles mean you can fire up the kit to its rated speed with a simple BIOS tweak, rather than having to deal with manual timing adjustments. The CAS latency for this DDR5-6400 kit is 36, which is quite tight for a preset profile. Thermal performance is said to be stellar thanks to the aluminum heat spreader, which should help dissipate heat during extended gaming sessions. Additionally, the heat spreader is also said to feature an "embossed pennant design that enhances the overall look and complements the lighting of other components." Speaking of lighting, the included EPIC-X RGB model features ARGB LEDs diffused through a geometric polymer light pipe and allows syncing via Asus Aura Sync, Gigabyte RGB Fusion, MSI Mystic Light, or ASRock Polychrome Sync software. Get the PNY RAM at the link below: PNY XLR8 Gaming Epic-X RGB™ 32GB (2x16GB) DDR5 RAM 6400 CL36-48-48-104 Desktop Memory Kit (MD32GK2D5640036XRGB): $72.24 (Sold and Shipped by Amazon US) (MSRP: $109.99) This Amazon deal is US-specific and not available in other regions unless specified. If you don't like it or want to look at more options, check out the Amazon US deals page here. Get Prime (SNAP), Prime Video, Audible Plus or Kindle / Music Unlimited. Free for 30 days. As an Amazon Associate, we earn from qualifying purchases.
    • Vivaldi 7.5 is out with colorful tab stacks, improved tab menu, and more by Taras Buria Vivaldi Technologies has released a new feature update for the Vivaldi browser. Version 7.5 is now available with some much-requested features by the community, privacy improvements, bug fixes, and other changes. The release is not the biggest one, but it still packs useful changes, such as colorful tab stacks, a reworked tab context menu, and multiple improvements under the hood: Vivaldi now supports colorful Tab Stacks. This feature groups related tabs and helps you keep everything well-organized. Now, besides giving your stacks names, you can assign colors, which makes it easier to find the stack you need. Developers also added a new dialog: right-click a stack, click "Edit Stack," and give it a new name or choose a color. The browser also received a cleaner and better-organized tab context menu. Vivaldi says the new version is more intuitive and easier to use. Another important change is the ability to define a custom DNS provider with support for DNS over HTTPS. Finally, here are some of the under-the-hood improvements in Vivaldi 7.5: Address Bar: Fixed focus weirdness, suggestion hiccups, and dropdown quirks Ad Blocker: Now supports badfilter, strict3p, and strict1p rules Bookmarks & Notes: Better drag-and-drop, with clearer visual feedback Mail & Calendar: Smarter threading, invite handling, and polish throughout Dashboard & Widgets: Layout tweaks, transparency improvements, drag-and-drop goodness Quick Commands: Now shows synced tabs and handles errors more gracefully Settings: UI improvements across the board, from DNS input to workspace rules visibility You can find the complete changelog for Vivaldi 7.5 in a blog post on the official website. If you want to try this highly customizable browser, download it using this link.
    • "While users may say they do not want their data used for personalized ads, Meta believes that without personalization, user experience declines with an almost 800% rise in ads being marked as “irrelevant” or “repetitive”. The ads might be more irrelevant, but it's not like people crave ads in the first place. My user experience with ads isn't better with personalization, because I don't want them there to begin with. So I might as well have non-personalized ads if I am gonna have ads, because then I at least get tracked less, and that makes it a better user experience for me.
    • The fact that they didn't offer a non-personalized ad-supported option, when they were mandated by law, was the final nail in the coffin in my case.
  • Recent Achievements

    • Week One Done
      BeeJay_Balu earned a badge
      Week One Done
    • Week One Done
      filminutz earned a badge
      Week One Done
    • Reacting Well
      SteveJaye earned a badge
      Reacting Well
    • One Month Later
      MadMung0 earned a badge
      One Month Later
    • One Month Later
      Uranus_enjoyer earned a badge
      One Month Later
  • Popular Contributors

    1. 1
      +primortal
      444
    2. 2
      ATLien_0
      161
    3. 3
      +FloatingFatMan
      147
    4. 4
      Nick H.
      65
    5. 5
      +thexfile
      62
  • Tell a friend

    Love Neowin? Tell a friend!