From abcffe8e31f1044cfc40ab52697f0b4165467dee Mon Sep 17 00:00:00 2001 From: Yuchen Pei Date: Thu, 19 Apr 2018 11:38:52 +0200 Subject: draft of update on open research --- site/blog.html | 46 +++++++++++++++++++++++++++++++++++++++------- 1 file changed, 39 insertions(+), 7 deletions(-) (limited to 'site/blog.html') diff --git a/site/blog.html b/site/blog.html index 4bfe026..c7f9340 100644 --- a/site/blog.html +++ b/site/blog.html @@ -18,6 +18,45 @@
+

Updates on open research

+

Posted on 2018-04-13

+

It has been 8 months since I last wrote about open research. Since then two things happened which prompted me to write an update.

+

First, I read about Richard Stallman the champion in the free software movement, in his biography by Sam Williams and his own collection of essays Free software, free society. For anyone interested in open research, I highly recommend having a look at these two books.

+

Also three weeks ago I attended a workshop titled Open research: rethinking scientific collaboration. That was the first time I met a group of people who also want open research to happen, and we had some stimulating discussions.

+

From both of these I have developed some ideas.

+

Freedom and community

+

This section is restricted to mathematical research / academia.

+

Ideals matter. My understanding is that Stallman’s struggles stemmed from the frustration of denied request of source code (a frustration I shared in academia except source code is replaced by maths knowledge), and revolved around two things that underlies the free software movement: freedom and community. That is, the freedom to use, modify and share a work, and by sharing, help the community.

+

Likewise, as for open research, apart from the utilitarian view that open research is more efficient / harder for credits theft, we should not ignore the ethical aspect that open research is right and fair.

+

In particular, I think freedom and community can also serve as principles in open research. One way to make this argument more concrete is to describe the problems of NDAs (non-disclosure agreements) and reproducibility.

+

NDAs. It is assumed that when establishing a collaboration, or during a discussion, the joint work in progress belongs to all those involved, and no one has the freedom to disclose any intermediate results without getting permissions from all collaborators. In effect this amounts to signing an NDA. Considering the primary goal of academia is to better human knowledge but not for profit, NDAs in research are unacceptable. They also restrict people’s freedom from sharing information that can benefit their own or others’ research.

+

Reproducibility. Papers written down are not necessarily reproducible, even though they have appeared on peer-reviewed journals. This is because the peer-review process is opaque and the proofs in the papers may not be clear to everyone. To make things worse, there are no open channels to discuss results in these papers and one may have to rely on interacting with the small circle of the informed. One example is folk theorems. Another is trade secrets required to decipher published works.

+

NDAs restrict the freedom of those holding information from disclosing it, thus they do not cover the trade secrets in ongoing research. That is, on the other side of the coin, it is also within one’s freedom to withhold such information, even though it is not nice to do so if the information can help others with their research.

+

But we do need a community that promotes and respects free flow of maths knowledge, in the spirit of the four essential freedoms, a community that rejects NDAs and upholds reproducibility,

+

Here are some ideas on how to tackle these two problems and build the community:

+
    +
  1. Free licensing. It solves NDA problem - free licenses permit redistribution and modification of works, so if you adopt them in your joint work, then you have the freedom to modify and distribute the work; it also helps with reproducibility - if a paper is not clear, anyone can write their own version and publish it. Bonus points with the use of copyleft licenses like Creative Commons Share-Alike or the GNU Free Documentation License.
  2. +
  3. A forum for discussions of mathematics. It helps solve the reproducibility problem - public interaction may help quickly clarify problems. By the way, Math Overflow is not a forum.
  4. +
  5. An infrastructure of mathematical knowledge. Like the GNU system, a mathematics encyclopedia under a copyleft license maintained in the Github-style rather than Wikipedia-style by a “Free Mathematics Foundation”, and drawing contributions from the public (inside or outside of the academia). To begin with, crowd-source (again, Github-style) the proofs of say 1000 foundational theorems covered in the curriculum of a bachelor’s degree. Perhaps start with taking contributions from people with some credentials (e.g. having a bachelor degree in maths) and then expand the contribution permission to the public.
  6. +
  7. Cite with care: if a work is considered authorative but you couldn’t reproduce the results, whereas another paper which tries to explain or discuss similar results makes the first paper understandable to you, give both papers due attribution (something like: see [1], but I couldn’t reproduce the proof in [1], and the proofs in [2] helped clarify it). No one should be offended if you say you can not reproduce something - there may be causes on both sides, whereas citing [2] is fairer and helps readers with a similar background.
  8. +
+

Tools for open research

+

The open research workshop revolved around how to lead the scientific academia towards open research. There were discussions on open research tools, improving credit attributions, the peer-review process and the path to adoption.

+

From the workshop I learned about some interesting tools for open research that are either new or just new to me, and it is exciting to see them.

+
    +
  • OSF, an online research platform. Clean and simple interface with commenting, wiki, citation generation, DOI generation, tags, license generation etc. Like Github it supports private and public repositories (but default to private), version control, with the ability to fork or bookmark a project.
  • +
  • SciPost, physics journals whose peer review reports and responses are public (peer-witnessed refereeing), and allows comments (post-publication evaluation). Like arXiv, it requires academic credential (PhD or above) to register.
  • +
  • Knowen, a platform to organise knowledge in directed acyclic graphs. Could be useful for building the infrastructure of mathematical knowledge.
  • +
  • Fermat’s Library, the journal club website that crowd-annotates one notable paper per week released a Chrome extension Librarian that overlays a commenting interface on arXiv.
  • +
  • The Polymath project, the famous massive collaborative mathematical project. Not exactly new, the Polymath project is the only open maths research project that has gained some traction and recognition. However, it does not have many projects (currently only one active project).
  • +
  • The Stacks Project, quite close to my vision of an open source infrastructure.
  • +
+

An anecdote from the workshop

+

In a conversation during the workshop, one of the participants called open science “normal science”, because reproducibility, open access, collaborations, and fair attributions are all what science is supposed to be, and practices like treating the readers as buyers rather than users should be called “bad science”, rather than “closed science”.

+

To which an organiser replied: maybe we should rename the workshop “Not-bad science”.

+ +
+

The Mathematical Bazaar

Posted on 2017-08-07

In this essay I describe some problems in academia of mathematics and propose an open source model, which I call open research in mathematics.

@@ -116,13 +155,6 @@

Copyright notice: This review is published at http://www.ams.org/mathscinet-getitem?mr=3306078, its copyright owned by the AMS.

-
-
-

On a causal quantum double product integral related to Lévy stochastic area.

-

Posted on 2015-07-01

-

In this paper with Robin we study the family of causal double product integrals \[ \prod_{a < x < y < b}\left(1 + i{\lambda \over 2}(dP_x dQ_y - dQ_x dP_y) + i {\mu \over 2}(dP_x dP_y + dQ_x dQ_y)\right) \]

-

where P and Q are the mutually noncommuting momentum and position Brownian motions of quantum stochastic calculus. The evaluation is motivated heuristically by approximating the continuous double product by a discrete product in which infinitesimals are replaced by finite increments. The latter is in turn approximated by the second quantisation of a discrete double product of rotation-like operators in different planes due to a result in (Hudson-Pei2015). The main problem solved in this paper is the explicit evaluation of the continuum limit W of the latter, and showing that W is a unitary operator. The kernel of W is written in terms of Bessel functions, and the evaluation is achieved by working on a lattice path model and enumerating linear extensions of related partial orderings, where the enumeration turns out to be heavily related to Dyck paths and generalisations of Catalan numbers.

-
-- cgit v1.2.3