RSS feed
6237cc2a
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/07/19 20:33
add pdf2txt rc script
b574ea6c
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/07/19 20:33
Significantly improved text output
8a7f9b4b
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/07/17 22:15
[parsing] fix unexpected report for n
30b1e2c8
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/03 16:00
[ops] add Tj
ee8ac152
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/03 02:27
ascii85: support z, validate each character, do NOT change the input buffer
c5a08187
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/03 02:22
merge AGAIN???
009e8c6a
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 12:10
merge
14a77353
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 12:07
convert image streams to plan 9 Memimage
2d3fd973
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 12:01
xref: missed a newline on error printing
4b5ef581
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/02 11:16
Merge
12d4d90f
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/02 11:15
add page number finder
fd70297e
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 09:23
ccittfax: use malloc since memmoving directly after
743c4502
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 09:11
stream: close the filter after use
ab56ad8a
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 08:31
todo: jp2 CMYK
21df0468
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/06/02 07:42
mkfile: add "deps" rule to install 3rd party stuff
ccb4d1dc
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 21:54
ignore BDC / EMC marked-content operators
5a668f4a
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 21:20
fix empty page handling
464b83a8
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 19:30
buffered pagerender
6d2612e1
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 19:30
don't print object when using " for page
cbd76101
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 18:30
only use stdout
a65b2474
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 18:14
improve pdf2txt heuristics
7dd87721
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 14:57
add preliminary heuristic-based text generation
b0515397
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 13:16
[array] split out arrayadd
ee088488
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 13:07
add opfind
b3610440
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/06/01 13:06
comment out unimplemented stubs for now
9d76340e
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/05/31 19:59
commit latest op.c changes
daf3484d
– Noam Preil <noam@pixelhero.dev>
authored
on 2021/05/31 11:51
Typo fix
f342f5a5
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/19 16:13
nop static funcs for ops
724f34cd
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/19 13:04
add type4 function ops
0edbea43
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/18 18:20
todo: Extends in ObjStm
51e620a8
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/16 12:51
ops: BX/EX
dd33a154
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/10 18:03
some op stubs
c626c599
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/10 12:57
add TODO to the readme
6405cc7d
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/10 12:42
RunLengthDecode: forgot to inc
6d76ea1f
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/10 12:41
RunLengthDecode
e74f2002
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 11:11
jbig2: extract as a full jbig2 file, honor JBIG2Globals
6c254ed2
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 11:10
ccitfax: static
0adf9d27
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 10:29
ccittfax: default width is 1728
3ca07537
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 10:20
dct, jpx: extract as is
f001da49
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 09:21
ccittfax: bps is always 1
a6c9e642
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 09:18
extract ccitt fax as tiff
0cba24e7
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2021/04/09 09:11
jbig2: copy input to output as is
77b04087
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/12/09 09:21
add * to list the keys of a dict
a04da7cf
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/25 17:30
mkfile: define BIN (thanks qwx)
5e27bf4b
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 18:44
update readme
98f900cd
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 18:32
command line: evaluate @NUMBER arguments as ref objects
6cb9961b
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 18:31
jpx filter: plain copy for now
01b0f541
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 17:17
remove TODO
21c3f8f2
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 15:19
mkfile: remove duplicate from OFILES
ef5d7bc1
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 15:18
lzw and flate filters: use common predict logic
7c3dae6e
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 14:43
ascii85 filter: call bufput once per 4 bytes
f33615cb
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 14:30
mkfile: add pdf.h to HFILES
3d14c5f6
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 14:29
lzw filter
d1a98f52
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 14:29
ascii85 filter: drain the input buffer
d40e02af
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/22 14:28
buffer: remove "eof" field
8c5ae53b
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 18:40
remove unneeded code
76bd9fac
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 15:26
ascii85 filter
75ceb243
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 15:25
flate filter: ignore trailing garbage
da441c02
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 15:25
object: print the position of the unexpected char
52ffbd2f
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 15:24
stream: wrong filters order, fix it; add Sobjoffset
a8650067
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 15:24
change the command line array indexing
a105fe96
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/21 07:56
keep filters in their own files
ac1954db
– Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/11/20 18:12
sort out integers and store the top of the document, not just the root
7c2e7951
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 17:06
leave dctd (jpeg) as is for now, ie just do a plain copy in the filter
ad1850d0
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 17:01
print the error when resolved object is null
c962ff8e
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 17:01
pdfobj: another return early error
93f2d56c
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 17:00
number can be negative
33ddc667
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 17:00
pdfobj: return error right away when pdfdict fails
e991b566
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 16:08
stream: fix objects with no filters
2cfd1426
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 16:08
dump to stdout as is
2ff084d1
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/09/01 12:19
fix tons of bugs, use proper streaming
474117ed
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/31 12:27
move xref logic into a separate file
17128cef
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/31 08:18
add license
ad28ff35
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/31 08:16
rename to pdffs
02fc81c3
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/31 07:50
pdfeval: handle nil case better
d613b76a
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/31 07:50
add array helpers
b4e6b3b8
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/31 07:49
fix flate-encoded streams with PNG prediction; parse compressed xref streams
ecd40a88
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/30 13:39
move stuff around and just use Biobuf* everywhere
745debcd
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 19:21
main: print filename on error
1e0a3529
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 19:16
remove stream/filter traces
df075d40
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 19:15
make pdfeval update the pointer; add a dumb ref counter to objects
1d93500d
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 18:46
better api (less Pdf *pdf); eval more often; use null
a9516693
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 12:29
add and use flate filter
ef6cdd0d
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 07:17
rename some of the functions
51cd3bfc
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/29 00:44
attach dicts to streams if there is one
a080ae88
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/28 22:45
rewrite the API, support more object types and actual evaluation
34238e0f
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/28 14:02
fix xref parsing and add pdfeval to resolve indirect objects
3c27f041
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/28 05:16
remove a todo
d9638664
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/27 20:43
add more object types, parse file trailer
f8f7ffe6
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/27 13:32
add more stuff
73b21f1b
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/27 07:00
pdfstring: octal chars
97218f13
– Sigrid Haflínudóttir <ftrvxmtrx@gmail.com>
authored
on 2020/08/20 12:47
just put it out