mj.ucw.cz Git - libucw.git/log

]> mj.ucw.cz Git - libucw.git/log

projects / libucw.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Robert Spalek [Fri, 25 Jun 2004 11:28:40 +0000 (11:28 +0000)]

modifications done at home, not yet incorporated MJ's objections:
- added buck2obj_{alloc,free}()
- extract_odes() renamed to buck2obj_convert() and simplified its interface.
  it flushes the memory pool and calls obj_new() as used e.g. in scanner.c
- attribute lengths are stored incremented s.t. 0-lengths are allowed
- the length of the compressed part is stored as U32 instead of UTF8 to allow
  0-copy compression
- temporary usage of object.c's oa_allocate removed and we call
  obj_add_attr_ref() instead of obj_add_attr()
- renamed constants BUCKET_TYPE_*
- defined internal macro RET_ERR()

commit | commitdiff | tree

Robert Spalek [Fri, 25 Jun 2004 11:07:08 +0000 (11:07 +0000)]

- deleted unused BUCKET_TYPE_V30C
- added BUCKET_TYPE_V33 and BUCKET_TYPE_V33_LIZARD

commit | commitdiff | tree

Robert Spalek [Fri, 25 Jun 2004 11:00:59 +0000 (11:00 +0000)]

- changed the format of struct oattr to allow 0-copy: byte val[] -> byte *val
- added obj_add_attr_ref() as a replacement for obj_add_attr()

commit | commitdiff | tree

Robert Spalek [Fri, 25 Jun 2004 10:41:45 +0000 (10:41 +0000)]

reordered the if's in lizard_compress(): nicer, clearer, and faster

commit | commitdiff | tree

Martin Mares [Fri, 25 Jun 2004 08:37:31 +0000 (08:37 +0000)]

Mainline is now v3.3.

commit | commitdiff | tree

Martin Mares [Fri, 25 Jun 2004 08:34:55 +0000 (08:34 +0000)]

Marked the current version as 3.2.1, which is going to be the stable
version for summer.

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 15:53:26 +0000 (15:53 +0000)]

a bugfix and polishing

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 15:33:59 +0000 (15:33 +0000)]

generalized for extracting also uncompressed buckets in the new 0-copy
format (assumes BUCKET_TYPE_V30F = 0x80000003 is added into lib/bucket.h)

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 15:21:38 +0000 (15:21 +0000)]

added a decoder from buckets to objects

just a proposed version for now (it even is not in the Makefile)

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 12:44:27 +0000 (12:44 +0000)]

hmm, after a sys-call is not called, we do not have to fiddle with errno

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 12:37:16 +0000 (12:37 +0000)]

added unhandle_signal() into sighandler.c

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 12:30:51 +0000 (12:30 +0000)]

changed the default action to test

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 12:30:38 +0000 (12:30 +0000)]

adapted to the new version of sighandler.c

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 12:29:44 +0000 (12:29 +0000)]

sighandler.c:
- used sigaction() instead of signal()
- no need to re-register the signal handler now :-)
- renamed my_sighandler_t to sh_sighandler_t and changed the interface

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 11:57:48 +0000 (11:57 +0000)]

- lizard_alloc() turns on the wrapper for SIGSEGV and lizard_free() restores
its original value
- lizard_decompress_safe() registers quickly the SIGSEGV handler using the
wrapper. it saves 2 sys-calls.

- allocate 3 more bytes for unaligned memory access

commit | commitdiff | tree

Robert Spalek [Thu, 24 Jun 2004 11:52:21 +0000 (11:52 +0000)]

added a wrapper for signal handlers in sighandler.c

commit | commitdiff | tree

Robert Spalek [Wed, 23 Jun 2004 17:09:37 +0000 (17:09 +0000)]

hmm, if we use fast unaligned access when decompressing, we should allocate 3
more bytes after the memory

commit | commitdiff | tree

Robert Spalek [Wed, 23 Jun 2004 16:48:25 +0000 (16:48 +0000)]

MJ's idea:
- only lock the memory by mprotect() once
- decompress into the middle of the buffer s.t. the barrier is at the distance
as it used before
(needs adding one pointer to the structure)

commit | commitdiff | tree

Robert Spalek [Wed, 23 Jun 2004 14:46:09 +0000 (14:46 +0000)]

MJ's suggestion:
use MAP_ANONYMOUS in mmap() instead of opening /dev/zero

commit | commitdiff | tree

Robert Spalek [Wed, 23 Jun 2004 14:41:42 +0000 (14:41 +0000)]

incorporated MJ's suggestions:
- flush_copy_command() exploits fast unaligned memory access and memcpy()
lizard_compress():
- the test in_start==copy_start replaced by flag bof, in_start deleted
- if (copy_len > 0) replaced by if (copy_len)
- pos_bit |= 1<<4
- deleted testing cropping at BOF, it is obsolete now
lizard_decompress():
- at label perform_copy_command, we set expect_copy_command=2
- exploit fast unaligned memory access

commit | commitdiff | tree

Martin Mares [Tue, 22 Jun 2004 14:02:33 +0000 (14:02 +0000)]

Allow empty host name for unknown URL schemes.

`foo:///bar' should be considered OK now.

commit | commitdiff | tree

Robert Spalek [Tue, 15 Jun 2004 09:36:38 +0000 (09:36 +0000)]

oops, one missing volatile :-)

commit | commitdiff | tree

Robert Spalek [Tue, 15 Jun 2004 09:21:39 +0000 (09:21 +0000)]

crash tests changed:
- since the memory protection is removed after decompress_safe(), it makes
  no sense to check the read/write protection on the corresponding page
  ===> removed
- now, if the returned value is < 0, print errno
- try allocating a too small buffer
  OR setting too low expected length

commit | commitdiff | tree

Robert Spalek [Tue, 15 Jun 2004 09:17:52 +0000 (09:17 +0000)]

added lizard-safe

commit | commitdiff | tree

Robert Spalek [Tue, 15 Jun 2004 09:17:36 +0000 (09:17 +0000)]

- lizard.c split into lizard.c and lizard-safe.c
- added LGPL header

commit | commitdiff | tree

Robert Spalek [Tue, 15 Jun 2004 09:16:13 +0000 (09:16 +0000)]

- low-level safe version of lizard_decompress() put into an extra source file

- use M_PRIVATE instead of M_SHARED
- use PROT_NONE instead of PROT_READ and only set/clear it for one page
before/after the operation instead of doing it for all the array
- errno is set instead of returning different negative values
- use longjmp in the signal handler instead of die() and return -1
- use macro ALIGN()

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 17:26:35 +0000 (17:26 +0000)]

print a fancy message when SIGSEGV is caught at the decompression

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 17:21:50 +0000 (17:21 +0000)]

tested the wrapper guarding the safe decompression

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 17:13:14 +0000 (17:13 +0000)]

added a wrapper for safe decompression (the page behind the decompression
buffer is set READ-ONLY)

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 11:07:45 +0000 (11:07 +0000)]

lizzard -> lizard

(the files have been renamed directly in CVS a few minutes earlier)

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 10:44:00 +0000 (10:44 +0000)]

improved the test tool, now it is easy to verify the compression

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 10:25:20 +0000 (10:25 +0000)]

added lizzard

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 10:24:56 +0000 (10:24 +0000)]

updated the maximum prolong-factor

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 10:12:19 +0000 (10:12 +0000)]

sped up approximately 6 times:
- the whole idea of 2 hash-tables (for 3- and 4- matches) was bad
- also, collision link-lists with errors were too bad
===> greatly simplified: only one hash-table/hash-function/link-list/... for
3-matches, double-linked link-list that can be maintained in constant time
while preserving correctness, links to strings made implicit (hence the data
structures is half-size and it fits better into the CPU-cache), no arithmetics
when computing the hash-function, tuned constants determining the compression
level, commented out code for 2-matches, ...

commit | commitdiff | tree

Robert Spalek [Mon, 14 Jun 2004 09:58:37 +0000 (09:58 +0000)]

debugged, now it is fully functional:
- a lot of typos (especially priorities of operators in C and variable name
mismatches)
- bit-format errors (forgotten additive constants or negations)
- do not use hash_rec[0]
- wrong entries in the collision link-lists must NOT appear in the beginning
==> saved time when verifying and solved some strange cases

- changed constants determining the maximum prolong-factor
- added a simple test-tool

commit | commitdiff | tree

Robert Spalek [Fri, 11 Jun 2004 15:28:29 +0000 (15:28 +0000)]

implemented fast compression and decompression routines

I have spent a lot of time on them and they just compile. I have never
tested them. The test program will follow soon.

commit | commitdiff | tree

Martin Mares [Mon, 31 May 2004 22:34:43 +0000 (22:34 +0000)]

Improved cvslog script.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 20:28:50 +0000 (20:28 +0000)]

When processing the `-S' option, work on a copy, so that the command
line displayed by `ps' is undisturbed.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 20:28:08 +0000 (20:28 +0000)]

Removed redundant <stdarg.h>.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 18:24:30 +0000 (18:24 +0000)]

Split logging functions to two files to avoid linking the full switching
machinery to all programs which use trivial logging to stdout.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 16:16:15 +0000 (16:16 +0000)]

The logger now exports the name of the current log file and calls a special
hook on die(). Also corrected log_switch_{en,dis}able().

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 14:12:16 +0000 (14:12 +0000)]

stralloc -> strdup.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 14:11:34 +0000 (14:11 +0000)]

Added xstrdup() and new logging functions.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 14:11:12 +0000 (14:11 +0000)]

Added functions for manual control of log switching. (Will be used by the
shepherd master.)

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 14:10:39 +0000 (14:10 +0000)]

cfg_stralloc() -> cfg_strdup() and use mp_strdup() to implement it.

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 14:10:12 +0000 (14:10 +0000)]

Renamed stralloc() to xstrdup() to be consistent with libc terminology
and also with mp_strdup().

commit | commitdiff | tree

Martin Mares [Mon, 10 May 2004 13:13:05 +0000 (13:13 +0000)]

Oops, this worked only by a chance.

commit | commitdiff | tree

Martin Mares [Mon, 3 May 2004 10:50:31 +0000 (10:50 +0000)]

Added the new pool-str module.

commit | commitdiff | tree

Martin Mares [Mon, 3 May 2004 10:45:18 +0000 (10:45 +0000)]

Added mp_strcat() and mp_multicat().

commit | commitdiff | tree

Martin Mares [Tue, 20 Apr 2004 16:02:33 +0000 (16:02 +0000)]

Fixed the comment.

commit | commitdiff | tree

Robert Spalek [Mon, 19 Apr 2004 16:41:45 +0000 (16:41 +0000)]

- 0x08 (BACKSPACE) is a blank character and it is accepted as an ASCII-character
- 0x7f is also accepted as an ASCII-character
- both gather/content.c and gather/charset.c now use the same function
Cblank() to test it

commit | commitdiff | tree

Martin Mares [Sun, 18 Apr 2004 13:39:32 +0000 (13:39 +0000)]

Changed locking rules. Scans and appends can peacefully co-exist now.
Should solve the problems with shep-reap waiting for bucket file transmission
to finish.

commit | commitdiff | tree

Martin Mares [Fri, 16 Apr 2004 15:32:56 +0000 (15:32 +0000)]

When logging to a file, redirect fd1 to the log file as well.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 20:36:01 +0000 (20:36 +0000)]

Multi-part objects (with header and body separated by an empty line and terminated
either by EOF or by a NUL byte) are very common, so let's introduce a special
function for reading them.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 20:35:24 +0000 (20:35 +0000)]

Got tired by repeating the same `gather pieces from incomplete reads' subroutine.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 15:13:16 +0000 (15:13 +0000)]

Added a simple test.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 14:45:56 +0000 (14:45 +0000)]

Add new modules.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 14:45:47 +0000 (14:45 +0000)]

Use format_exit_status(). One more ASSERT.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 14:45:19 +0000 (14:45 +0000)]

Including <stdarg.h> in lib/lib.h enables us to finally export vlog_msg()
which I was missing for several times.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 14:44:40 +0000 (14:44 +0000)]

Added exitstatus and runcmd functions. Finally decided to include <stdarg.h>
in lib.h.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 14:43:55 +0000 (14:43 +0000)]

Added a couple of functions for running of external commands with error
checking. A nice replacement for system().

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 14:43:30 +0000 (14:43 +0000)]

Added a new module for formatting of process exit status messages.

commit | commitdiff | tree

Martin Mares [Sat, 10 Apr 2004 13:05:05 +0000 (13:05 +0000)]

Added one more check.

commit | commitdiff | tree

Martin Mares [Thu, 8 Apr 2004 22:18:19 +0000 (22:18 +0000)]

More enhancement to the main loop library: Export all lists for easy inspection
(reading only) by the callers. When a process exits, construct a nice tombstone
string for it.

commit | commitdiff | tree

Martin Mares [Thu, 8 Apr 2004 21:16:16 +0000 (21:16 +0000)]

Read and write functions accept void pointers instead of byte*.

commit | commitdiff | tree

Martin Mares [Wed, 7 Apr 2004 22:03:30 +0000 (22:03 +0000)]

Added a universal main loop with timers, file descriptor polling and process
watching. Inspired by the glib main loop, but this one has a much nicer
interface.

It will be used in the Shepherd master and if it turns out to be useful,
I'll convert the other programs to use it some day.

commit | commitdiff | tree

Martin Mares [Sat, 20 Mar 2004 23:25:31 +0000 (23:25 +0000)]

Interface to our own regex library.

commit | commitdiff | tree

Martin Mares [Sat, 20 Mar 2004 23:23:57 +0000 (23:23 +0000)]

Better explanation.

commit | commitdiff | tree

Martin Mares [Sat, 20 Mar 2004 23:22:52 +0000 (23:22 +0000)]

After another couple of hours spent digging in regular expression libraries
I decided to use a copy of the glibc 2.3.2 regex routines.

commit | commitdiff | tree

Martin Mares [Sat, 20 Mar 2004 22:31:31 +0000 (22:31 +0000)]

One more deadly testcase.

commit | commitdiff | tree

Martin Mares [Thu, 18 Mar 2004 22:15:48 +0000 (22:15 +0000)]

Added configuration options for libpcre.

commit | commitdiff | tree

Martin Mares [Mon, 15 Mar 2004 22:57:11 +0000 (22:57 +0000)]

Converted regex module testing to the new test framework.

commit | commitdiff | tree

Martin Mares [Mon, 15 Mar 2004 22:56:20 +0000 (22:56 +0000)]

Added a very simple utility for performing unit tests on modules.

commit | commitdiff | tree

Martin Mares [Sun, 14 Mar 2004 12:58:40 +0000 (12:58 +0000)]

Our regex functions are now able to interface to old-style BSD re_match(),
to POSIX regexec() and to libpcre. Currently it's switched to the BSD mode
as before, I'll look at it more in the evening.

commit | commitdiff | tree

Martin Mares [Sat, 13 Mar 2004 22:31:55 +0000 (22:31 +0000)]

Make regex-t compile.

commit | commitdiff | tree

Martin Mares [Sat, 13 Mar 2004 14:39:34 +0000 (14:39 +0000)]

Welcome to v3.2 world on the mainline :)

commit | commitdiff | tree

Martin Mares [Fri, 12 Mar 2004 13:54:46 +0000 (13:54 +0000)]

Seek to end of an empty fbmem stream needs a special exception.

commit | commitdiff | tree

Robert Spalek [Mon, 8 Mar 2004 14:20:13 +0000 (14:20 +0000)]

Ctrl-L is marked blank

commit | commitdiff | tree

Martin Mares [Tue, 2 Mar 2004 15:38:20 +0000 (15:38 +0000)]

When we try to create a temporary file and it already exists (which can happen
if a program with the same PID has crashed at some time in the past), don't
panic and rewrite the file. Should be safe since we're using our own tmp directory
nobody else can access.

commit | commitdiff | tree

Martin Mares [Sun, 29 Feb 2004 19:07:39 +0000 (19:07 +0000)]

Bumped the version number to 3.1.

commit | commitdiff | tree

Martin Mares [Sat, 28 Feb 2004 11:03:42 +0000 (11:03 +0000)]

Added REV_COMPARE(x,y) which is equivalent to COMPARE(y,x), but it's
better readable to use both of them instead of swapping the arguments.

commit | commitdiff | tree

Martin Mares [Sat, 28 Feb 2004 10:55:05 +0000 (10:55 +0000)]

`buckettools -c' now uses fastbufs for output.

commit | commitdiff | tree

Martin Mares [Sat, 28 Feb 2004 10:49:48 +0000 (10:49 +0000)]

Hopefully finally sorted out the "http://www.xyz.cz?param" mess. The true
semantics turned out to be "http://www.xyz.cz/?param" and most web servers
really require "GET /?param".

I've changed the normalization rules to add the leading slash if needed
which also solves the relative URL problem I mentioned in the comments.

However, this means that the SEMANTICS OF NORMALIZED URL'S HAS CHANGED
and gatherer databases with URL's in the "http://www.xyz.cz?param" form
are now INVALID. I'm going to delete all such URL's from our gatherer now.

commit | commitdiff | tree

Martin Mares [Fri, 27 Feb 2004 15:36:38 +0000 (15:36 +0000)]

Hmmm, COMPILE_ASSERT is a better name.

commit | commitdiff | tree

Martin Mares [Fri, 27 Feb 2004 15:32:59 +0000 (15:32 +0000)]

Added CPP_ASSERT.

commit | commitdiff | tree

Martin Mares [Wed, 25 Feb 2004 20:30:50 +0000 (20:30 +0000)]

Forgot to add this one.

commit | commitdiff | tree

Martin Mares [Wed, 25 Feb 2004 20:29:01 +0000 (20:29 +0000)]

Cleaned up the ancient chartype functions: kicked out accent tables (which
were not exported anyway), added conversion to lowercase.

commit | commitdiff | tree

Martin Mares [Wed, 25 Feb 2004 19:44:32 +0000 (19:44 +0000)]

Removed hopefully all implicit dependencies on file type numbering.

Also, CONFIG_LANG without CONFIG_FILETYPE works if you define your
own CA_GET_FILE_LANG macro.

commit | commitdiff | tree

Martin Mares [Tue, 24 Feb 2004 23:08:49 +0000 (23:08 +0000)]

CF_USAGE_TAB can be used to insert more tabs to the default help message.

commit | commitdiff | tree

Martin Mares [Tue, 24 Feb 2004 23:08:31 +0000 (23:08 +0000)]

Added mp_strdup().

commit | commitdiff | tree

Martin Mares [Tue, 24 Feb 2004 18:36:23 +0000 (18:36 +0000)]

Blank lines are considered separators, not terminators of buckets.
Hence extraneous blank lines between buckets and trailing blank lines
after the last buckets are all ignored.

commit | commitdiff | tree

Martin Mares [Tue, 24 Feb 2004 18:22:14 +0000 (18:22 +0000)]

"http://hell.org?xyz" really is a valid URL. Also checked wrt. the current
RFC 2396 and added several comments about where do we differ.

commit | commitdiff | tree

Martin Mares [Wed, 18 Feb 2004 12:20:26 +0000 (12:20 +0000)]

Moved struct card_prints to index.h, because it's used outside the indexer.

Added CARD_FLAG_OVERRIDEN.

commit | commitdiff | tree

Martin Mares [Mon, 16 Feb 2004 16:07:11 +0000 (16:07 +0000)]

Declare variables for library names in the top-level Makefile to avoid
forward references.

commit | commitdiff | tree

Martin Mares [Sun, 15 Feb 2004 18:25:25 +0000 (18:25 +0000)]

Introduced COMPARE macro for use in sorter callbacks.

commit | commitdiff | tree

Martin Mares [Sun, 15 Feb 2004 17:36:42 +0000 (17:36 +0000)]

Don't forget to load config file.

commit | commitdiff | tree

Martin Mares [Sun, 15 Feb 2004 17:31:42 +0000 (17:31 +0000)]

Added testing routines for URL key calculator.

commit | commitdiff | tree

Martin Mares [Tue, 10 Feb 2004 18:18:12 +0000 (18:18 +0000)]

Supply default element swapping macro.

commit | commitdiff | tree

Martin Mares [Thu, 5 Feb 2004 20:27:23 +0000 (20:27 +0000)]

Any non-zero value enables an option. (Allows CONFIG_NUM_CONTEXTS to be
recognized as a switch.)

commit | commitdiff | tree

Martin Mares [Thu, 5 Feb 2004 19:37:25 +0000 (19:37 +0000)]

Nested conditionals were processed incorrectly. Also, the "#" at the start
of directives is now mandatory.

UCW libraries