shithub: pdffs

RSS feed

6237cc2a – Noam Preil <noam@pixelhero.dev> authored on 2021/07/19 20:33
add pdf2txt rc script

b574ea6c – Noam Preil <noam@pixelhero.dev> authored on 2021/07/19 20:33
Significantly improved text output

8a7f9b4b – Noam Preil <noam@pixelhero.dev> authored on 2021/07/17 22:15
[parsing] fix unexpected report for n

30b1e2c8 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/03 16:00
[ops] add Tj

ee8ac152 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/03 02:27
ascii85: support z, validate each character, do NOT change the input buffer

c5a08187 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/03 02:22
merge AGAIN???

009e8c6a – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 12:10
merge

14a77353 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 12:07
convert image streams to plan 9 Memimage

2d3fd973 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 12:01
xref: missed a newline on error printing

4b5ef581 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/02 11:16
Merge

12d4d90f – Noam Preil <noam@pixelhero.dev> authored on 2021/06/02 11:15
add page number finder

fd70297e – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 09:23
ccittfax: use malloc since memmoving directly after

743c4502 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 09:11
stream: close the filter after use

ab56ad8a – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 08:31
todo: jp2 CMYK

21df0468 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/06/02 07:42
mkfile: add "deps" rule to install 3rd party stuff

ccb4d1dc – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 21:54
ignore BDC / EMC marked-content operators

5a668f4a – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 21:20
fix empty page handling

464b83a8 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 19:30
buffered pagerender

6d2612e1 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 19:30
don't print object when using " for page

cbd76101 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 18:30
only use stdout

a65b2474 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 18:14
improve pdf2txt heuristics

7dd87721 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 14:57
add preliminary heuristic-based text generation

b0515397 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 13:16
[array] split out arrayadd

ee088488 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 13:07
add opfind

b3610440 – Noam Preil <noam@pixelhero.dev> authored on 2021/06/01 13:06
comment out unimplemented stubs for now

9d76340e – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/05/31 19:59
commit latest op.c changes

daf3484d – Noam Preil <noam@pixelhero.dev> authored on 2021/05/31 11:51
Typo fix

f342f5a5 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/19 16:13
nop static funcs for ops

724f34cd – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/19 13:04
add type4 function ops

0edbea43 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/18 18:20
todo: Extends in ObjStm

51e620a8 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/16 12:51
ops: BX/EX

dd33a154 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/10 18:03
some op stubs

c626c599 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/10 12:57
add TODO to the readme

6405cc7d – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/10 12:42
RunLengthDecode: forgot to inc

6d76ea1f – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/10 12:41
RunLengthDecode

e74f2002 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 11:11
jbig2: extract as a full jbig2 file, honor JBIG2Globals

6c254ed2 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 11:10
ccitfax: static

0adf9d27 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 10:29
ccittfax: default width is 1728

3ca07537 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 10:20
dct, jpx: extract as is

f001da49 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 09:21
ccittfax: bps is always 1

a6c9e642 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 09:18
extract ccitt fax as tiff

0cba24e7 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2021/04/09 09:11
jbig2: copy input to output as is

77b04087 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/12/09 09:21
add * to list the keys of a dict

a04da7cf – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/25 17:30
mkfile: define BIN (thanks qwx)

5e27bf4b – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 18:44
update readme

98f900cd – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 18:32
command line: evaluate @NUMBER arguments as ref objects

6cb9961b – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 18:31
jpx filter: plain copy for now

01b0f541 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 17:17
remove TODO

21c3f8f2 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 15:19
mkfile: remove duplicate from OFILES

ef5d7bc1 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 15:18
lzw and flate filters: use common predict logic

7c3dae6e – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 14:43
ascii85 filter: call bufput once per 4 bytes

f33615cb – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 14:30
mkfile: add pdf.h to HFILES

3d14c5f6 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 14:29
lzw filter

d1a98f52 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 14:29
ascii85 filter: drain the input buffer

d40e02af – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/22 14:28
buffer: remove "eof" field

8c5ae53b – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 18:40
remove unneeded code

76bd9fac – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 15:26
ascii85 filter

75ceb243 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 15:25
flate filter: ignore trailing garbage

da441c02 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 15:25
object: print the position of the unexpected char

52ffbd2f – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 15:24
stream: wrong filters order, fix it; add Sobjoffset

a8650067 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 15:24
change the command line array indexing

a105fe96 – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/21 07:56
keep filters in their own files

ac1954db – Sigrid Solveig Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/11/20 18:12
sort out integers and store the top of the document, not just the root

7c2e7951 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 17:06
leave dctd (jpeg) as is for now, ie just do a plain copy in the filter

ad1850d0 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 17:01
print the error when resolved object is null

c962ff8e – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 17:01
pdfobj: another return early error

93f2d56c – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 17:00
number can be negative

33ddc667 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 17:00
pdfobj: return error right away when pdfdict fails

e991b566 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 16:08
stream: fix objects with no filters

2cfd1426 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 16:08
dump to stdout as is

2ff084d1 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/09/01 12:19
fix tons of bugs, use proper streaming

474117ed – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/31 12:27
move xref logic into a separate file

17128cef – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/31 08:18
add license

ad28ff35 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/31 08:16
rename to pdffs

02fc81c3 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/31 07:50
pdfeval: handle nil case better

d613b76a – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/31 07:50
add array helpers

b4e6b3b8 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/31 07:49
fix flate-encoded streams with PNG prediction; parse compressed xref streams

ecd40a88 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/30 13:39
move stuff around and just use Biobuf* everywhere

745debcd – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 19:21
main: print filename on error

1e0a3529 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 19:16
remove stream/filter traces

df075d40 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 19:15
make pdfeval update the pointer; add a dumb ref counter to objects

1d93500d – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 18:46
better api (less Pdf *pdf); eval more often; use null

a9516693 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 12:29
add and use flate filter

ef6cdd0d – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 07:17
rename some of the functions

51cd3bfc – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/29 00:44
attach dicts to streams if there is one

a080ae88 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/28 22:45
rewrite the API, support more object types and actual evaluation

34238e0f – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/28 14:02
fix xref parsing and add pdfeval to resolve indirect objects

3c27f041 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/28 05:16
remove a todo

d9638664 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/27 20:43
add more object types, parse file trailer

f8f7ffe6 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/27 13:32
add more stuff

73b21f1b – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/27 07:00
pdfstring: octal chars

97218f13 – Sigrid Haflínudóttir <ftrvxmtrx@gmail.com> authored on 2020/08/20 12:47
just put it out