Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems.
- PDF (Wikipedia)
Bruteforce password
See PDFCrack.
Merge/Combine PDF files
- Combine PDF files (Adobe) – requires to log in
Using Python
Prerequisites
pip install pypdf4
pip --proxy http://127.0.0.1:3128 install pypdf4
merge_pdf.py
from PyPDF4 import PdfFileReader, PdfFileWriter
def merge_pdfs(paths, output):
pdf_writer = PdfFileWriter()
for path in paths:
pdf_reader = PdfFileReader(path)
for page in range(pdf_reader.getNumPages()):
# Add each page to the writer object
#pdf_writer.addPage(pdf_reader.getPage(page).rotateClockwise(180))
pdf_writer.addPage(pdf_reader.getPage(page))
# Write out the merged PDF
with open(output, 'wb') as out:
pdf_writer.write(out)
if __name__ == '__main__':
dir = 'C:/Users/somepath/'
paths = [dir + 'p1.pdf', dir + 'p2.pdf']
merge_pdfs(paths, output=dir + 'merged.pdf')