I am aware that fsutil via cmd line can create file types of any particular size and format, but what I have been trying to source is test files that contain context as opposed to jibberish.
Idealistically, i would be able to access some open source test documents that are numbered 1MB, 5Mb, 10MB, etc and listed in varying formats. Otherwise to manually create these will be a bit of a pain if I am to make them precise for the proposed testing.
Any thoughts/feelings/suggestions greatly appreciated.
You could try with https://www.gutenberg.org. I cannot access the site at the moment as the whole of Germany is blocked there now but they have public domain e-books of various sizes and if I remember correctly they offered at least 4 different formats including PDF and hmm, was it MOBI, but also plain text and HTML. Iโm not sure how large you can go with the documents there but you can always concatenate some if you need.
Bumping this thread because I have this issue recently. Iโm testing that the system can accept file uploads up to a certain limit like 10MB so my approach was to do boundary testing. (9.9MB, 10MB 10.1MB, whatever that is in bytes)
That means I need to generate files of various formats like docx, pdf, png, but itโs difficult to get it very exact to the bytes. I tried to do generate a file through Powershell then change the file extension. However, the system I am testing can read the MIME type and wonโt accept a dummy file generated this way.
I did also look at the project gutenberg site posted by @maos, but I still need to scour different books to find the exact sizes I need which is time consuming.
You can always use real files as a starting point and โwriteโ just the content of the real files into the beginning of each dummy file? Have to ask why 10MB? Also, is it a web based app?
Think on whatโs the real cost to the business if it fails? Could you do this same valuable thing through another way, for example by unit/component testing?