Pilot The PCCV Project: Information
Techniques (CVonline) Other stuff |
Adrian F. Clark,
Dept. ESE, University of Essex, Colchester, CO4 3SQ, UK
Pattern recognition, computer vision, and related research areas are
some of the most rapidly developing ones. And yet, despite the
ubiquity of computer technology in the research work, almost all
formal professional communication takes place via the printed page ---
an astonishing state of affairs when one considers it dispassionately.
This essay explores the possibilities of, and the potential
difficulties posed by, online publication of research results in
vision and related fields. It is principally intended to promote
discussion of this topic, something that the authors believe is long
overdue. However, the reader may care to note that much of this
discussion is quite general and may be applied to any research field.
We shall proceed by examining the purposes of communicating the fruits
of research, and then consider existing publication mechanisms and
their drawbacks. Online publication will then be considered and its
advantages and disadvantages explored. Some potential publication
mechanisms are discussed, including ways for creating a complementary
relationship with conventional publication routes. Finally, some
outstanding issues are outlined.
Before discussing the reasons for publishing, it is perhaps
appropriate to state the reasons for doing research at all. The
authors are of the opinion that any type of research is driven by the
desire to extend the corpus of human knowledge. Starting from this
point, it follows that professional communication is intended
principally to inform other researchers of one's results, in the
spirit of openness and free exchange of ideas that characterizes
scientific enquiry. It thereby avoids unnecessary duplication of work
and archives those results for future generations of researchers and
developers. In other words, publication helps to make research
efficient by allowing science to be built upon previous
science. With the pressure on us to produce research results ever
increasing --- and the funding available to do so generally decreasing
--- it is vital that this notion of efficient communication of
results continues and is developed.
Further reasons for publication have arisen over the years, largely
for validation of research work. In many countries, the numbers and
types of publication are taken as indicators of research activity and
used as ``objective'' measures in the assessment of individuals,
departments, and institutions. (Many people question the
objectiveness of such measures since they take no account of
significance; nor are publications the only measure of one's research
activities. But this is not the place for such a discussion.) There
is also the wish to receive recognition for one's own work, and to
advance the reputation of one's institution, both of which are
commendable and perfectly natural.
Let us explore this idea of extending human knowledge further
by considering the publication routes currently available to us.
Journals exist principally to disseminate research methodology and
results to other workers active in the research area. Books typically
provide overviews of the research field in a more tutorial fashion
than do papers: their coverage is normally wider but less detailed.
(There are exceptions to this, of course: many monographs summarize
research work in book form. Nevertheless, the general argument
stands.) Conversely, conferences are generally used for exploring
ideas and presenting preliminary results, their proceedings being
principally a record of people's opinions before the event. They are
usually regarded as being more ephemeral than journal or book
publications, though the increasing tendency to publish conference
proceedings in book form mitigates against this. Indeed, there are
many very widely-referenced conference publications --- and equally
many rarely-referenced journal ones.
The crucial factor of these publication routes, developed over the
years, is that of peer review: an author must be able to
convince other experts in the research area that his or her work is
worthy of publication. The standard of refereeing for journal
publications is typically quite rigourous (though, as we all know, of
variable quality) and the golden rule is that the publication should
make a contribution to the field --- this is the abovementioned
principle of extending human knowledge. Other types of paper
are also of value, of course: reviews of particular techniques or
applications, tutorials, and so on.
For books, peer review is also an intrinsic part of the writing
process, though the overriding factor here --- at least from an
author's point of view --- is the quality of the explanations.
(Publishers are, quite rightly, concerned that the book should have a
market.) Originality is less important in book production since the
intended audience is usually non-expert.
Conference publications are also subject to peer review but this is
typically less rigourous than for other types of publication. This is
partly because conferences are intended to be a place where one may
discuss ideas and results with other researchers, and so the ability
to present work in progress is advantageous. There are other reasons
too: the review panel for a conference is typically of limited size
and the deadlines required to ensure rapid decisions are such that
errors may be overlooked. The more cynical among us might even remark
that conferences exist principally to make money and consequently even
poor submissions may be accepted by some conference organizers to
increase the number of delegates --- the recent VIDEA controversy
[7] illustrates this type of problem.
There is another type of publication that must enter our
consideration, and that is the technical report. These are,
quite rightly, regarded with some caution as they have not been
subject to the peer review process; hence, their accuracy and
correctness are always suspect. Nevertheless, there are many
excellent research reports. Indeed, in many cases where both research
report and published paper exist, the research report is actually the
better work of reference because there is more detail of the
methodology and results.
The number of research reports made available is rapidly increasing;
some of the reasons for this phenomenon are discussed later. Although
this is generally a convenient state of affairs, since reports are
available quickly after a piece of work has been completed, there are
also dangers: the lack of peer review and the possibilities for bias
are obvious. More worrying in the longer term however, is the
potential for the loss of knowledge: technical reports can --- and
often do --- disappear as quickly as they appear, and when that
happens they are rapidly forgotten.
Having considered these routes for dissemination of results and their
success, let us turn our attention to the problems associated with
them.
From the above discussion, it might appear that the journal
publication route would be difficult to improve upon. This may be
true in principle, but the practice is somewhat different. Firstly,
the number of journals has increased dramatically in recent years, due
to a number of factors:
These developments, each of which seems beneficial individually, have
some unfortunate consequences when considered in toto: although
journal publication has become easier and arguably cheaper, the cost
to institutional libraries has certainly not decreased; and, since the
number of journals required to maintain a good coverage of a research
area has typically increased, many libraries are now in the position
where they are having to terminate their subscriptions to journals.
This does not benefit the researcher, who must either take out
personal subscriptions (not feasible if he or she is working alone or
as part of a small group) or find some other means of monitoring
published papers, perhaps by travelling to a neighbouring institution
or by making use of abstracting services (which also cost the
institution money, of course). It is easy to say that many of the
`upstart' journals are of indifferent quality, but this does not imply
that all papers published in them should be disregarded.
A related issue, also consequent on the increasing specialization of
researchers, is the duplication of work that is well-established in
other fields. For example, few vision researchers read the computer
graphics literature or vice versa even though the geometrical
basis for the two is identical; people working in remote sensing and
medical image processing who need to align images are not familiar
with techniques developed in electron microscopy; and so on. There
are probably two reasons for this: the relevant journals may not be
available to researchers, and they may not have time to study them in
any detail. It might be argued that this is a direct consequence of
there being too many journals; however, the authors would argue that,
with effective abstracting services, it should be easy to locate
relevant papers. (We shall return to this topic later.)
The second major problem with journal publication is that their
latency --- the delay between submission and publication --- is too
long, up to two or three years for the more prestigious
journals. Although accepting that rigourous peer review takes time, it
is difficult to argue that this is efficient professional
communication. Since many research groups work on closely-related
problems at the same time, this lag in making new ideas and results
known does us a disservice. Indeed, the increasing tendency to make
available internal research reports is partially in response to this
phenomenon.
The final point to make about journal publications has already been
touched upon above: constraints are invariably placed upon authors
regarding the lengths of papers. In order to be able to discuss
methodology and present experimental data, this results in papers that
are often difficult to read and understand. These same constraints
mean that there is rarely space for authors to describe why they have
taken the approach being described and so on. (Indeed, one of the
authors has been asked, on more than one occasion, to remove such
material from papers because it is `not relevant.') However, this
type of discussion offers insight to the reader. There is a
well-publicized case in which researchers involved in the early
development of atomic energy are being interviewed by the current
generation of researchers in order to capture just this type of
information while it is still possible: they have found that reading
research papers and reports is simply not enough.
The space constraints mentioned for journal publications normally
apply with a vengeance to conference proceedings: three or four
quarto-sized pages is a not uncommon limit. This is enough space to
present a technique in a compact form, along with one or two
(hopefully representative) results, but nothing more.
We have to ask ourselves whether these types of publication are
serving us well. To give a topical example, one of the main problems
facing vision at the moment is producing techniques that are
robust. Do publications that show only one or two
carefully-selected examples really further the discipline? The
authors would argue that they do not. Of much more value are reports
of work that present a technique, test it on a significant corpus of
data, and characterize the technique's performance in some way,
perhaps by comparison with other published techniques. It is not
necessary that a new technique perform better than a well-established
one, merely that it shows promise. (This has long been accepted in
the image coding field, where objective quality measures are in
widespread use. Perversely, coding is one of the few fields where
subjective quality measures are actually more important than objective
ones! But even subjective responses may be assessed in an objective
way.) The page limits of paper journals does not encourage this
type of analysis.
Having expounded the basis for making others aware of research and
identified the major drawbacks of existing publication routes, let us
turn our attention to the dissemination of information over computer
networks. Just as with conventional journals, there are several
classes of online communication, which we shall consider in the
following paragraphs.
The first class, familiar to almost every researcher, comprises direct
electronic mail (email) and email lists. There are several email
lists in the vision area (indeed, one of the authors moderates one).
A culture has grown whereby these lists are used principally for
distributing conference calls, announcing technical reports or
software, and so on. There is almost no technical discussion in them
--- which is something of a surprise, since that is precisely what most
of them were set up for!
Closely related to email lists are newsgroups. Although there are
many newsgroups that carry relevant and useful information, there is
only one dedicated to vision --- and that simply mirrors a mailing
list! In fact, with the opening up of the Internet, the
signal-to-noise ratio of most newsgroups is rapidly approaching zero,
and the politeness and respect for other people's opinions has already
vanished. None of the information dissemination routes in this email
and newsgroup class provide for the more formal types of communication
that are the subject of this essay.
The second class of communication route, more relevant to our
discussion, comprises two separate phenomena:
The third class --- and the current state of the art --- consists of
journals on the World-Wide Web (WWW) which offer quality output via
POSTSCRIPT and are able to accommodate mathematics, tabular
and graphical material. They offer some advantages over printed
journals too: they are searchable and are able to offer hypertext
links within papers and to cited works. Many of these journals are
organized analogously to conventional publications, having rigourous
peer review, editorial boards, and so on. A good example is
[5]
(but see also [11])
and it is worth noting
that several well-established conventional journals and profressional
societies are slowly moving towards online publication (e.g.,
[6,
1]).
This latter category is almost enough to meet the needs of the vision,
pattern recognition, and related communities --- but not quite. The
additional technology required, however, does exist, albeit in a
rather experimental form in some cases.
Images, the staple diet of vision research, are supported by all the
popular WWW browsers, both on workstations and on PC-class machines.
Both uncompressed (PBMPLUS) and compressed (lossy via JPEG or lossless
via GIF) formats are supported. Online viewing of imagery has four
major advantages over printed imagery:
A topic that is important in many areas is motion --- and this
is something that no printed journal can accommodate. (Actually,
there are image coding journals that have tried distributing videos
with issues, but this is problematic: libraries are not well-equipped
for storing or viewing them, and the degradations introduced by bulk
copying onto even U-matic videos can mask fine detail.) In the online
case, of course, replaying motion sequences comes down to having an
appropriate ``helper application'' for a WWW browser. The current
norm is to use MPEG-1 decoders; while this is not necessarily
appropriate for all applications due to the nature of the compression,
it is certainly acceptable for viewing. Indeed, with a careful choice
of viewer, one may replay sequences in slow motion or frame-by-frame.
Although vision per se is concerned principally with imagery
and information derived therefrom, the closely-related discipline of
pattern recognition need not. Research in areas such as speech
recognition --- which has made important contributions to image
analysis --- would also benefit from the ability to play back audio
data. As with motion sequences, most of the popular WWW browsers
support audio. It must be admitted that there are a number of formats
for storing audio data, but it is easy to acquire software (e.g.,
sox) that performs format inter-conversion.
There are other developments in terms of presenting data on the WWW
that will be beneficial. The most significant of these is likely to
be the Virtual Reality Modelling Language (VRML), a ``language'' for
specifying 3D environments. Since one of the most important topics in
vision is the retrieval of 3D structure (from stereo, motion, shading,
etc), a facility whereby an author may show his or her results by means
of a 3D model, one that the reader may rotate using a VRML browser,
would be invaluable. The current generation of VRML browsers display
static scenes; however, adding ``behaviours'' to VRML is under active
discussion, so it will soon be possible to produce animated 3D
displays, with obvious applications in displaying results.
The features just mentioned can be supported by an online publication
much more easily than by a conventional publication and without the
degradations caused by using paper as an intermediate medium.
Moreover, this can be extended even further if the final destination
of the research is considered to be not just the human reader but also
his or her computer. This raises the possibility of exchangeable
datasets. Although databases of images exist, it is not clear how
widely they are used and they seem to be somewhat decoupled from the
publication aspect of research. By making an explicit link between
the publication and the image data within it, the reuse of this data
is encouraged, facilitating comparative evaluation of algorithms.
What applies to data applies equally to code. Authors are discouraged
from presenting code in their papers even though this is often an easy
way of explaining the operation of an algorithm. (This has not always
been the case: consider, for example, Singleton's classic FFT paper
[8].)
If source code were available within a
publication, it would encourage readers to try out their own
experiments and build upon the work presented. Not all authors would
wish to do this of course, but at present the path is not open those
who do.
We feel that all these features --- especially quality machine
readable image data sets and code --- would be of substantial value to
the vision community.
The obvious medium for disseminating online publications is via the
WWW, as WWW browsers such as Netscape and Mosaic
already support images, audio, and video on both PC-class machines and
workstations. Since there are no `page' constraints on WWW documents,
authors are free to write arbitrarily long papers if their work
justifies it. (Of course, extraneous and over-long text will have to
be shortened, but this is a common outcome of reviewing.) Moreover,
many of the shortcomings of HTML, the mark-up scheme for WWW
documents, are being eradicated: HTML3 will be SGML-compliant, support
the concept of style sheets, and incorporate facilities for
mathematics and tabular material.
A further advantage of the WWW for `publication' is that utilities
exist to convert from the major text preparation tools used in science
and engineering (e.g., LaTeX, MS Word, WordPerfect, FrameMaker) to HTML.
This allows authors to prepare their submissions in an environment
that is familiar, if they so wish, leaving the conversion to HTML
until the point of submission. (Or perhaps even later: it might be
easier for the editors to perform the conversion, as they may wish to
use customized converters that impose the journal's style.) The
viability of this process is illustrated by this document, which was
written using LaTeX mark-up and converted to HTML via
latex2HTML.
To be successful, an online journal should not be tied to a particular
server: there should be servers in different geographical regions that
mirror each others' papers and WWW pages. (There is, of course,
software that can carry this out automatically.) As well as providing
fault tolerance, so that if one server were down, a reader need only
point his or her WWW browser to another, this scheme would ensure
researchers obtain the best response for reading or submitting papers.
Having produced papers on the WWW, it is but a small step to
contemplate recording them on CD-ROM. This might, for example, be an
appropriate way to solve the problem of distributing online journals
to libraries. (It is unlikely that this could be arranged free of
charge, and so this has potential as a source of revenue for
individual journals.) Paper copies are still required in order for
the journal to be assigned an ISSN; but the number of copies that must
be lodged is small and they are easily generated from the online
version.
Let us consider how an online journal might appear, for both a reader
and an author. The scenario described below is just one possibility,
one that the authors consider is most likely to be successful;
alternative suggestions are, of course, welcomed.
There is one item in the list that may appear strange for an online
journal:
The great advantage of this `publication' route is that the only
delays built into the system are those involved in the peer review
process. It is also entirely possible that editors may link together
papers. This might, for example, be used to trace the development of
a technique through a number of references; or it might be used to
link to further papers that agree or disagree with an author's
findings.
The key to a high quality professional journal is the peer review
system. The present system works to an extent, but poorly selected,
inexperienced and/or overworked referees mean that the process is
often less than perfect. There are many other ways of applying peer
review to maintain high quality which an online journal would
facilitate. One option is that of ``open peer commentary'' practised
in Brain and Behavioural Sciences
[4].
In this journal, papers are
accompanied by as many as twenty or thirty 1000-word invited
commentaries from experts, together with a response from the authors
of the initial paper. This allows the papers to be placed in context
and encourages fruitful discussion of issues raised. A hyperlinked
online journal without page limits would allow such a scheme to be
applied. Another possibility is to include, within the online
journal, a list of links to unrefereed papers or technical reports, to
encourage readers to keep up to date with the latest work and to
communicate with their authors.
Apart from the question of peer review, the major concern expressed by
potential authors when the idea of online publishing is suggested is
that of plagiarism, of someone copying sections from published
material in order to construct a sensible-looking document. This is,
of course, entirely possible --- but the technology that makes
plagiarism simple also aids its detection: a wily referee or editor
can easily perform searches through published online papers to locate
such transgressions. And if a note to the effect that a paper
contains plagiarized material is attached to a publication by the
linking mechanism discussed elsewhere in this document, the
repercussions to the plagiarist outweigh the potential advantages. In
fact, plagiarism works best if the work of the original author is
little known for reasons such as geography or the time lag in
publication; but if the original work is diffused immediately and
globally via the network, effective plagiarism becomes much harder.
In any case, with the widespread availability of scanners and OCR
software, the budding plagiarist already has all the required tools at
his or her disposal in respect of conventional journals.
Another vexed area is that of copyright. Most existing journals
require authors to transfer copyright to them before publication.
Many authors are unhappy with this arrangement. We need to decide
what rights need to be transferred from the author to a journal for
publication; we can then decide upon appropriate mechanisms for so
doing.
There are still a number of other unanswered questions concerning the
precise procedures to follow and the tools to use. These issues may
perhaps be addressed best by actually setting up a pilot online
journal. The authors have some concrete ideas in this respect,
which they will make public after some discussion has taken place: we
do not wish to constrain discussion by imposing them at this
juncture.
We feel that there is a definite need for our existing publication
mechanisms to evolve to accommodate the computer technology with which
we are all familiar. We feel that doing so will bring substantial
benefits to us all. It will break the space and time barriers imposed
by the traditional system by removing page limits and reducing the
time lags involved. We feel that viewing the computer as one of the
consumers of research results and methods, rather than just as an
intermediary, has the potential to make the field more collaborative
and to enable us all to make greater progress.
|
Last updated 2012-06-21 11:13:08 by Adrian F. Clark. [Accessibility information.]