diff options
Diffstat (limited to 'microposts')
40 files changed, 0 insertions, 376 deletions
diff --git a/microposts/2048-mdp.md b/microposts/2048-mdp.md deleted file mode 100644 index cee7bbf..0000000 --- a/microposts/2048-mdp.md +++ /dev/null @@ -1,4 +0,0 @@ ---- -date: 2018-05-25 ---- -[This post](http://jdlm.info/articles/2018/03/18/markov-decision-process-2048.html) models 2048 as an MDP and solves it using policy iteration and backward induction. diff --git a/microposts/ats.md b/microposts/ats.md deleted file mode 100644 index 798ecae..0000000 --- a/microposts/ats.md +++ /dev/null @@ -1,7 +0,0 @@ ---- -date: 2018-05-22 ---- - -> ATS (Applied Type System) is a programming language designed to unify programming with formal specification. ATS has support for combining theorem proving with practical programming through the use of advanced type systems. A past version of The Computer Language Benchmarks Game has demonstrated that the performance of ATS is comparable to that of the C and C++ programming languages. By using theorem proving and strict type checking, the compiler can detect and prove that its implemented functions are not susceptible to bugs such as division by zero, memory leaks, buffer overflow, and other forms of memory corruption by verifying pointer arithmetic and reference counting before the program compiles. Additionally, by using the integrated theorem-proving system of ATS (ATS/LF), the programmer may make use of static constructs that are intertwined with the operative code to prove that a function attains its specification. - -[Wikipedia entry on ATS](https://en.wikipedia.org/wiki/ATS_(programming_language)) diff --git a/microposts/bostoncalling.md b/microposts/bostoncalling.md deleted file mode 100644 index 32a5265..0000000 --- a/microposts/bostoncalling.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-05-20 ---- - -(5-second fame) I sent a picture of my kitchen sink to BBC and got mentioned in the [latest Boston Calling episode](https://www.bbc.co.uk/programmes/w3cswg8c) (listen at 25:54). diff --git a/microposts/boyer-moore.md b/microposts/boyer-moore.md deleted file mode 100644 index e3e0f9c..0000000 --- a/microposts/boyer-moore.md +++ /dev/null @@ -1,17 +0,0 @@ ---- -date: 2018-06-04 ---- - -The [Boyer-Moore algorithm for finding the majority of a sequence of elements](https://en.wikipedia.org/wiki/Boyer–Moore_majority_vote_algorithm) falls in the category of "very clever algorithms". - - int majorityElement(vector<int>& xs) { - int count = 0; - int maj = xs[0]; - for (auto x : xs) { - if (x == maj) count++; - else if (count == 0) maj = x; - else count--; - } - return maj; - } - diff --git a/microposts/catalan-overflow.md b/microposts/catalan-overflow.md deleted file mode 100644 index 8f26145..0000000 --- a/microposts/catalan-overflow.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-06-11 ---- - -To compute Catalan numbers without unnecessary overflow, use the recurrence formula $C_n = {4 n - 2 \over n + 1} C_{n - 1}$. diff --git a/microposts/colah-blog.md b/microposts/colah-blog.md deleted file mode 100644 index 10b9df4..0000000 --- a/microposts/colah-blog.md +++ /dev/null @@ -1,4 +0,0 @@ ---- -date: 2018-05-18 ---- -[colah's blog](https://colah.github.io/) has a cool feature that allows you to comment on any paragraph of a blog post. Here's an [example](https://colah.github.io/posts/2015-08-Understanding-LSTMs/). If it is doable on a static site hosted on Github pages, I suppose it shouldn't be too hard to implement. This also seems to work more seamlessly than [Fermat's Library](https://fermatslibrary.com/), because the latter has to embed pdfs in webpages. Now fantasy time: imagine that one day arXiv shows html versions of papers (through author uploading or conversion from TeX) with this feature. diff --git a/microposts/coursera-basic-income.md b/microposts/coursera-basic-income.md deleted file mode 100644 index 7ae4770..0000000 --- a/microposts/coursera-basic-income.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-06-20 ---- - -Coursera is having [a Teach-Out on Basic Income](https://www.coursera.org/learn/exploring-basic-income-in-a-changing-economy). diff --git a/microposts/darknet-diaries.md b/microposts/darknet-diaries.md deleted file mode 100644 index 1e0b945..0000000 --- a/microposts/darknet-diaries.md +++ /dev/null @@ -1,6 +0,0 @@ ---- -date: 2018-08-13 ---- - -[Darknet Diaries](https://darknetdiaries.com) is a cool podcast. -According to its about page it covers "true stories from the dark side of the Internet. Stories about hackers, defenders, threats, malware, botnets, breaches, and privacy." diff --git a/microposts/decss-haiku.md b/microposts/decss-haiku.md deleted file mode 100644 index e7a98d5..0000000 --- a/microposts/decss-haiku.md +++ /dev/null @@ -1,58 +0,0 @@ ---- -date: 2019-03-16 ---- ->``` ->Muse! When we learned to ->count, little did we know all ->the things we could do -> ->some day by shuffling ->those numbers: Pythagoras ->said "All is number" -> ->long before he saw ->computers and their effects, ->or what they could do -> ->by computation, ->naive and mechanical ->fast arithmetic. -> ->It changed the world, it ->changed our consciousness and lives ->to have such fast math -> ->available to ->us and anyone who cared ->to learn programming. -> ->Now help me, Muse, for ->I wish to tell a piece of ->controversial math, -> ->for which the lawyers ->of DVD CCA ->don't forbear to sue: -> ->that they alone should ->know or have the right to teach ->these skills and these rules. -> ->(Do they understand ->the content, or is it just ->the effects they see?) -> ->And all mathematics ->is full of stories (just read ->Eric Temple Bell); -> ->and CSS is ->no exception to this rule. ->Sing, Muse, decryption -> ->once secret, as all ->knowledge, once unknown: how to ->decrypt DVDs. ->``` - -Seth Schoen, [DeCSS haiku](https://en.wikipedia.org/wiki/DeCSS_haiku) diff --git a/microposts/defense-stallman.md b/microposts/defense-stallman.md deleted file mode 100644 index 1ed1def..0000000 --- a/microposts/defense-stallman.md +++ /dev/null @@ -1,7 +0,0 @@ ---- -date: 2019-09-30 ---- - -Someone wrote a bold article titled ["In Defense of Richard Stallman"](https://geoff.greer.fm/2019/09/30/in-defense-of-richard-stallman/). Kudos to him. - -Also, an interesting read: [Famous public figure in tech suffers the consequences for asshole-ish behavior](https://cfenollosa.com/blog/famous-computer-public-figure-suffers-the-consequences-for-asshole-ish-behavior.html). diff --git a/microposts/fsf-membership.md b/microposts/fsf-membership.md deleted file mode 100644 index a468300..0000000 --- a/microposts/fsf-membership.md +++ /dev/null @@ -1,16 +0,0 @@ ---- -date: 2020-08-02 ---- - -I am a proud associate member of Free Software Freedom. For me the -philosophy of Free Software is about ensuring the enrichment of a -digital commons, so that knowledge and information are not -concentrated in the hands of selected privileged people and locked up -as "intellectual property". The genius of copyleft licenses like GNU -(A)GPL ensures software released for the public, remains public. Open -source does not care about that. - -If you also care about the public good, the hacker ethics, or the -spirit of the web, please take a moment to consider joining FSF as an -associate member. It comes with [numerous perks and -benefits](https://www.fsf.org/associate/benefits).
\ No newline at end of file diff --git a/microposts/gavin-belson.md b/microposts/gavin-belson.md deleted file mode 100644 index 33e8395..0000000 --- a/microposts/gavin-belson.md +++ /dev/null @@ -1,9 +0,0 @@ ---- -date: 2018-12-11 ---- - -> I don’t know about you people, but I don’t want to live in a world where someone else makes the world a better place better than we do. - -Gavin Belson, Silicon Valley S2E1. - -I came across this quote in [a Slate post about Facebook](https://slate.com/business/2018/12/facebook-emails-lawsuit-embarrassing-mark-zuckerberg.html) diff --git a/microposts/google-search-not-ai.md b/microposts/google-search-not-ai.md deleted file mode 100644 index 4bff7c2..0000000 --- a/microposts/google-search-not-ai.md +++ /dev/null @@ -1,6 +0,0 @@ ---- -date: 2018-04-30 ---- -> But, users have learned to accommodate to Google not the other way around. We know what kinds of things we can type into Google and what we can’t and we keep our searches to things that Google is likely to help with. We know we are looking for texts and not answers to start a conversation with an entity that knows what we really need to talk about. People learn from conversation and Google can’t have one. It can pretend to have one using Siri but really those conversations tend to get tiresome when you are past asking about where to eat. - -Roger Schank - [Fraudulent claims made by IBM about Watson and AI](http://www.rogerschank.com/fraudulent-claims-made-by-IBM-about-Watson-and-AI)
\ No newline at end of file diff --git a/microposts/hacker-ethics.md b/microposts/hacker-ethics.md deleted file mode 100644 index c8d77d4..0000000 --- a/microposts/hacker-ethics.md +++ /dev/null @@ -1,11 +0,0 @@ ---- -date: 2018-04-06 ---- -> - Access to computers—and anything that might teach you something about the way the world works—should be unlimited and total. Always yield to the Hands-On Imperative! -> - All information should be free. -> - Mistrust Authority—Promote Decentralization. -> - Hackers should be judged by their hacking, not bogus criteria such as degrees, age, race, or position. -> - You can create art and beauty on a computer. -> - Computers can change your life for the better. - -[The Hacker Ethic](https://en.wikipedia.org/wiki/Hacker_ethic), [Hackers: Heroes of Computer Revolution](https://en.wikipedia.org/wiki/Hackers:_Heroes_of_the_Computer_Revolution), by Steven Levy diff --git a/microposts/hackers-excerpt.md b/microposts/hackers-excerpt.md deleted file mode 100644 index 2076578..0000000 --- a/microposts/hackers-excerpt.md +++ /dev/null @@ -1,9 +0,0 @@ ---- -date: 2018-06-15 ---- - -> But as more nontechnical people bought computers, the things that impressed hackers were not as essential. While the programs themselves had to maintain a certain standard of quality, it was quite possible that the most exacting standards—those applied by a hacker who wanted to add one more feature, or wouldn’t let go of a project until it was demonstrably faster than anything else around—were probably counterproductive. What seemed more important was marketing. There were plenty of brilliant programs which no one knew about. Sometimes hackers would write programs and put them in the public domain, give them away as easily as John Harris had lent his early copy of Jawbreaker to the guys at the Fresno computer store. But rarely would people ask for public domain programs by name: they wanted the ones they saw advertised and discussed in magazines, demonstrated in computer stores. It was not so important to have amazingly clever algorithms. Users would put up with more commonplace ones. -> -> The Hacker Ethic, of course, held that every program should be as good as you could make it (or better), infinitely flexible, admired for its brilliance of concept and execution, and designed to extend the user’s powers. Selling computer programs like toothpaste was heresy. But it was happening. Consider the prescription for success offered by one of a panel of high-tech venture capitalists, gathered at a 1982 software show: “I can summarize what it takes in three words: marketing, marketing, marketing.” When computers are sold like toasters, programs will be sold like toothpaste. The Hacker Ethic notwithstanding. - -[Hackers: Heroes of Computer Revolution](http://www.stevenlevy.com/index.php/books/hackers), by Steven Levy. diff --git a/microposts/how-can-you-help-ia.md b/microposts/how-can-you-help-ia.md deleted file mode 100644 index 88b56f6..0000000 --- a/microposts/how-can-you-help-ia.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2020-06-21 ---- - -[How can you help the Internet Archive?](https://blog.archive.org/2020/06/14/how-can-you-help-the-internet-archive/) Use it. It's more than the Wayback Machine. And get involved. diff --git a/microposts/how-to-learn-on-your-own.md b/microposts/how-to-learn-on-your-own.md deleted file mode 100644 index 758bf0f..0000000 --- a/microposts/how-to-learn-on-your-own.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-05-30 ---- - -Roger Grosse's post [How to learn on your own (2015)](https://metacademy.org/roadmaps/rgrosse/learn_on_your_own) is an excellent modern guide on how to learn and research technical stuff (especially machine learning and maths) on one's own. diff --git a/microposts/ia-lawsuit.md b/microposts/ia-lawsuit.md deleted file mode 100644 index 9c77920..0000000 --- a/microposts/ia-lawsuit.md +++ /dev/null @@ -1,9 +0,0 @@ ---- -date: 2020-08-02 ---- - -The four big publishers Hachette, HarperCollins, Wiley, and Penguin Random House are still pursuing Internet Archive. - -> [Their] lawsuit does not stop at seeking to end the practice of Controlled Digital Lending. These publishers call for the destruction of the 1.5 million digital books that Internet Archive makes available to our patrons. This form of digital book burning is unprecedented and unfairly disadvantages people with print disabilities. For the blind, ebooks are a lifeline, yet less than one in ten exists in accessible formats. Since 2010, Internet Archive has made our lending library available to the blind and print disabled community, in addition to sighted users. If the publishers are successful with their lawsuit, more than a million of those books would be deleted from the Internet’s digital shelves forever. - -[Libraries lend books, and must continue to lend books: Internet Archive responds to publishers’ lawsuit](https://blog.archive.org/2020/07/29/internet-archive-responds-to-publishers-lawsuit/)
\ No newline at end of file diff --git a/microposts/learning-knowledge-graph-reddit-journal-club.md b/microposts/learning-knowledge-graph-reddit-journal-club.md deleted file mode 100644 index eba3f82..0000000 --- a/microposts/learning-knowledge-graph-reddit-journal-club.md +++ /dev/null @@ -1,15 +0,0 @@ ---- -date: 2018-05-07 ---- -### Learning via knowledge graph and reddit journal clubs -It is a natural idea to look for ways to learn things like going through a skill tree in a computer RPG. - -For example I made a [DAG for juggling](https://ypei.me/posts/2015-04-02-juggling-skill-tree.html). - -Websites like [Knowen](https://knowen.org) and [Metacademy](https://metacademy.org) explore this idea with added flavour of open collaboration. - -The design of Metacademy looks quite promising. It also has a nice tagline: "your package manager for knowledge". - -There are so so many tools to assist learning / research / knowledge sharing today, and we should keep experimenting, in the hope that eventually one of them will scale. - -On another note, I often complain about the lack of a place to discuss math research online, but today I found on Reddit some journal clubs on machine learning: [1](https://www.reddit.com/r/MachineLearning/comments/8aluhs/d_machine_learning_wayr_what_are_you_reading_week/), [2](https://www.reddit.com/r/MachineLearning/comments/8elmd8/d_anyone_having_trouble_reading_a_particular/). If only we had this for maths. On the other hand r/math does have some interesting recurring threads as well: [Everything about X](https://www.reddit.com/r/math/wiki/everythingaboutx) and [What Are You Working On?](https://www.reddit.com/r/math/search?q=what+are+you+working+on?+author:automoderator+&sort=new&restrict_sr=on&t=all). Hopefully these threads can last for years to come.
\ No newline at end of file diff --git a/microposts/learning-undecidable.md b/microposts/learning-undecidable.md deleted file mode 100644 index 34e5e27..0000000 --- a/microposts/learning-undecidable.md +++ /dev/null @@ -1,29 +0,0 @@ ---- -date: 2019-01-27 ---- - -My take on the [Nature paper _Learning can be undecidable_](https://www.nature.com/articles/s42256-018-0002-3): - -Fantastic article, very clearly written. - -So it reduces a kind of learninability called estimating the maximum (EMX) to the cardinality of real numbers which is undecidable. - -When it comes to the relation between EMX and the rest of machine learning framework, the article mentions that EMX belongs to "extensions of PAC learnability include Vapnik’s statistical learning setting and the equivalent general learning setting by Shalev-Shwartz and colleagues" (I have no idea what these two things are), but it does not say whether EMX is representative of or reduces to common learning tasks. So it is not clear whether its undecidability applies to ML at large. - -Another condition to the main theorem is the union bounded closure assumption. It seems a reasonable property of a family of sets, but then again I wonder how that translates to learning. - -The article says "By now, we know of quite a few independence [from mathematical axioms] results, mostly for set theoretic questions like the continuum hypothesis, but also for results in algebra, analysis, infinite combinatorics and more. Machine learning, so far, has escaped this fate." but the description of the EMX learnability makes it more like a classical mathematical / theoretical computer science problem rather than machine learning. - -An insightful conclusion: "How come learnability can neither be proved nor refuted? A closer look reveals that the source of the problem is in defining learnability as the existence of a learning function rather than the existence of a learning algorithm. In contrast with the existence of algorithms, the existence of functions over infinite domains is a (logically) subtle issue." - -In relation to practical problems, it uses an example of ad targeting. However, A lot is lost in translation from the main theorem to this ad example. - -The EMX problem states: given a domain X, a distribution P over X which is unknown, some samples from P, and a family of subsets of X called F, find A in F that approximately maximises P(A). - -The undecidability rests on X being the continuous [0, 1] interval, and from the insight, we know the problem comes from the cardinality of subsets of the [0, 1] interval, which is "logically subtle". - -In the ad problem, the domain X is all potential visitors, which is finite because there are finite number of people in the world. In this case P is a categorical distribution over the 1..n where n is the population of the world. One can have a good estimate of the parameters of a categorical distribution by asking for sufficiently large number of samples and computing the empirical distribution. Let's call the estimated distribution Q. One can choose the from F (also finite) the set that maximises Q(A) which will be a solution to EMX. - -In other words, the theorem states: EMX is undecidable because not all EMX instances are decidable, because there are some nasty ones due to infinities. That does not mean no EMX instance is decidable. And I think the ad instance is decidable. Is there a learning task that actually corresponds to an undecidable EMX instance? I don't know, but I will not believe the result of this paper is useful until I see one. - -h/t Reynaldo Boulogne diff --git a/microposts/margins.md b/microposts/margins.md deleted file mode 100644 index 0eb3306..0000000 --- a/microposts/margins.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-10-05 ---- - -With Fermat's Library's new tool [margins](https://fermatslibrary.com/margins), you can host your own journal club. diff --git a/microposts/math-writing-decoupling.md b/microposts/math-writing-decoupling.md deleted file mode 100644 index e765b71..0000000 --- a/microposts/math-writing-decoupling.md +++ /dev/null @@ -1,10 +0,0 @@ ---- -date: 2018-05-10 ---- -### Writing readable mathematics like writing an operating system - -One way to write readable mathematics is to decouple concepts. One idea is the following template. First write a toy example with all the important components present in this example, then analyse each component individually and elaborate how (perhaps more complex) variations of the component can extend the toy example and induce more complex or powerful versions of the toy example. Through such incremental development, one should be able to arrive at any result in cutting edge research after a pleasant journey. - -It's a bit like the UNIX philosophy, where you have a basic system of modules like IO, memory management, graphics etc, and modify / improve each module individually (H/t [NAND2Tetris](http://nand2tetris.org/)). - -The book [Neutral networks and deep learning](http://neuralnetworksanddeeplearning.com/) by Michael Nielsen is an example of such approach. It begins the journey with a very simple neutral net with one hidden layer, no regularisation, and sigmoid activations. It then analyses each component including cost functions, the back propagation algorithm, the activation functions, regularisation and the overall architecture (from fully connected to CNN) individually and improve the toy example incrementally. Over the course the accuracy of the example of mnist grows incrementally from 95.42% to 99.67%. diff --git a/microposts/neural-nets-activation.md b/microposts/neural-nets-activation.md deleted file mode 100644 index a0d7a20..0000000 --- a/microposts/neural-nets-activation.md +++ /dev/null @@ -1,6 +0,0 @@ ---- -date: 2018-05-09 ---- -> What makes the rectified linear activation function better than the sigmoid or tanh functions? At present, we have a poor understanding of the answer to this question. Indeed, rectified linear units have only begun to be widely used in the past few years. The reason for that recent adoption is empirical: a few people tried rectified linear units, often on the basis of hunches or heuristic arguments. They got good results classifying benchmark data sets, and the practice has spread. In an ideal world we'd have a theory telling us which activation function to pick for which application. But at present we're a long way from such a world. I should not be at all surprised if further major improvements can be obtained by an even better choice of activation function. And I also expect that in coming decades a powerful theory of activation functions will be developed. Today, we still have to rely on poorly understood rules of thumb and experience. - -Michael Nielsen, [Neutral networks and deep learning](http://neuralnetworksanddeeplearning.com/chap6.html#convolutional_neural_networks_in_practice)
\ No newline at end of file diff --git a/microposts/neural-nets-regularization.md b/microposts/neural-nets-regularization.md deleted file mode 100644 index 9f2866d..0000000 --- a/microposts/neural-nets-regularization.md +++ /dev/null @@ -1,8 +0,0 @@ ---- -date: 2018-05-08 ---- -> no-one has yet developed an entirely convincing theoretical explanation for why regularization helps networks generalize. Indeed, researchers continue to write papers where they try different approaches to regularization, compare them to see which works better, and attempt to understand why different approaches work better or worse. And so you can view regularization as something of a kludge. While it often helps, we don't have an entirely satisfactory systematic understanding of what's going on, merely incomplete heuristics and rules of thumb. -> -> There's a deeper set of issues here, issues which go to the heart of science. It's the question of how we generalize. Regularization may give us a computational magic wand that helps our networks generalize better, but it doesn't give us a principled understanding of how generalization works, nor of what the best approach is. - -Michael Nielsen, [Neural networks and deep learning](http://neuralnetworksanddeeplearning.com/chap3.html#why_does_regularization_help_reduce_overfitting) diff --git a/microposts/neural-networks-programming-paradigm.md b/microposts/neural-networks-programming-paradigm.md deleted file mode 100644 index e84b7f2..0000000 --- a/microposts/neural-networks-programming-paradigm.md +++ /dev/null @@ -1,8 +0,0 @@ ---- -date: 2018-05-01 ---- -> Neural networks are one of the most beautiful programming paradigms ever invented. In the conventional approach to programming, we tell the computer what to do, breaking big problems up into many small, precisely defined tasks that the computer can easily perform. By contrast, in a neural network we don't tell the computer how to solve our problem. Instead, it learns from observational data, figuring out its own solution to the problem at hand. - -Michael Nielsen - [What this book (Neural Networks and Deep Learning) is about](http://neuralnetworksanddeeplearning.com/about.html) - -Unrelated to the quote, note that Nielsen's book is licensed under [CC BY-NC](https://creativecommons.org/licenses/by-nc/3.0/deed.en_GB), so one can build on it and redistribute non-commercially. diff --git a/microposts/neural-turing-machine.md b/microposts/neural-turing-machine.md deleted file mode 100644 index 96f422e..0000000 --- a/microposts/neural-turing-machine.md +++ /dev/null @@ -1,8 +0,0 @@ ---- -date: 2018-05-09 ---- -> One way RNNs are currently being used is to connect neural networks more closely to traditional ways of thinking about algorithms, ways of thinking based on concepts such as Turing machines and (conventional) programming languages. [A 2014 paper](https://arxiv.org/abs/1410.4615) developed an RNN which could take as input a character-by-character description of a (very, very simple!) Python program, and use that description to predict the output. Informally, the network is learning to "understand" certain Python programs. [A second paper, also from 2014](https://arxiv.org/abs/1410.5401), used RNNs as a starting point to develop what they called a neural Turing machine (NTM). This is a universal computer whose entire structure can be trained using gradient descent. They trained their NTM to infer algorithms for several simple problems, such as sorting and copying. -> -> As it stands, these are extremely simple toy models. Learning to execute the Python program `print(398345+42598)` doesn't make a network into a full-fledged Python interpreter! It's not clear how much further it will be possible to push the ideas. Still, the results are intriguing. Historically, neural networks have done well at pattern recognition problems where conventional algorithmic approaches have trouble. Vice versa, conventional algorithmic approaches are good at solving problems that neural nets aren't so good at. No-one today implements a web server or a database program using a neural network! It'd be great to develop unified models that integrate the strengths of both neural networks and more traditional approaches to algorithms. RNNs and ideas inspired by RNNs may help us do that. - -Michael Nielsen, [Neural networks and deep learning](http://neuralnetworksanddeeplearning.com/chap6.html#other_approaches_to_deep_neural_nets) diff --git a/microposts/nlp-arxiv.md b/microposts/nlp-arxiv.md deleted file mode 100644 index 2b7e36a..0000000 --- a/microposts/nlp-arxiv.md +++ /dev/null @@ -1,4 +0,0 @@ ---- -date: 2018-05-08 ---- -Primer Science is a tool by a startup called Primer that uses NLP to summarize contents (but not single papers, yet) on arxiv. A developer of this tool predicts in [an interview](https://twimlai.com/twiml-talk-136-taming-arxiv-w-natural-language-processing-with-john-bohannon/#) that progress on AI's ability to extract meanings from AI research papers will be the biggest accelerant on AI research.
\ No newline at end of file diff --git a/microposts/open-library.md b/microposts/open-library.md deleted file mode 100644 index c4e92a1..0000000 --- a/microposts/open-library.md +++ /dev/null @@ -1,4 +0,0 @@ ---- -date: 2020-06-12 ---- -Open Library was cofounded by Aaron Swartz. As part of the Internet Archive, it has done good work to spread knowledge. However it is currently [being sued by four major publishers](https://arstechnica.com/tech-policy/2020/06/internet-archive-ends-emergency-library-early-to-appease-publishers/) for the [National Emergency Library](https://archive.org/details/nationalemergencylibrary). IA decided to [close the NEL two weeks earlier than planned](https://blog.archive.org/2020/06/10/temporary-national-emergency-library-to-close-2-weeks-early-returning-to-traditional-controlled-digital-lending/), but the lawsuit is not over, which in the worst case scenario has the danger of resulting in Controlled Digital Lending being considered illegal and (less likely) bancruptcy of the Internet Archive. If this happens it will be a big setback of the free-culture movement. diff --git a/microposts/open-review-net.md b/microposts/open-review-net.md deleted file mode 100644 index e219eb7..0000000 --- a/microposts/open-review-net.md +++ /dev/null @@ -1,9 +0,0 @@ ---- -date: 2018-05-14 ---- - -### Open peer review - -Open peer review means peer review process where communications e.g. comments and responses are public. - -Like [SciPost](https://scipost.org/) mentioned in [my post](/posts/2018-04-10-update-open-research.html), [OpenReview.net](https://openreview.net) is an example of open peer review in research. It looks like their focus is machine learning. Their [about page](https://openreview.net/about) states their mission, and here's [an example](https://openreview.net/group?id=ICLR.cc/2018/Conference) where you can click on each entry to see what it is like. We definitely need this in the maths research community. diff --git a/microposts/pun-generator.md b/microposts/pun-generator.md deleted file mode 100644 index baf8dcd..0000000 --- a/microposts/pun-generator.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-06-19 ---- - -[Pun generators exist](https://en.wikipedia.org/wiki/Computational_humor#Pun_generation). diff --git a/microposts/random-forests.md b/microposts/random-forests.md deleted file mode 100644 index 93bc704..0000000 --- a/microposts/random-forests.md +++ /dev/null @@ -1,14 +0,0 @@ ---- -date: 2018-05-15 ---- - -### Notes on random froests - -[Stanford Lagunita's statistical learning course](https://lagunita.stanford.edu/courses/HumanitiesSciences/StatLearning/Winter2016/info) has some excellent lectures on random forests. It starts with explanations of decision trees, followed by bagged trees and random forests, and ends with boosting. From these lectures it seems that: - -1. The term "predictors" in statistical learning = "features" in machine learning. -2. The main idea of random forests of dropping predictors for individual trees and aggregate by majority or average is the same as the idea of dropout in neural networks, where a proportion of neurons in the hidden layers are dropped temporarily during different minibatches of training, effectively averaging over an emsemble of subnetworks. Both tricks are used as regularisations, i.e. to reduce the variance. The only difference is: in random forests, all but a square root number of the total number of features are dropped, whereas the dropout ratio in neural networks is usually a half. - -By the way, here's a comparison between statistical learning and machine learning from the slides of the Statistcal Learning course: - -<a href="../assets/resources/sl-vs-ml.png"><img src="../assets/resources/sl-vs-ml.png" alt="SL vs ML" style="width:38em" /></a> diff --git a/microposts/rnn-fsm.md b/microposts/rnn-fsm.md deleted file mode 100644 index 61b500f..0000000 --- a/microposts/rnn-fsm.md +++ /dev/null @@ -1,14 +0,0 @@ ---- -date: 2018-05-11 ---- -### Some notes on RNN, FSM / FA, TM and UTM - -Related to [a previous micropost](#neural-turing-machine). - -[These slides from Toronto](http://www.cs.toronto.edu/~rgrosse/csc321/lec9.pdf) are a nice introduction to RNN (recurrent neural network) from a computational point of view. It states that RNN can simulate any FSM (finite state machine, a.k.a. finite automata abbr. FA) with a toy example computing the parity of a binary string. - -[Goodfellow et. al.'s book](http://www.deeplearningbook.org/contents/rnn.html) (see page 372 and 374) goes one step further, stating that RNN with a hidden-to-hidden layer can simulate Turing machines, and not only that, but also the *universal* Turing machine abbr. UTM (the book referenced [Siegelmann-Sontag](https://www.sciencedirect.com/science/article/pii/S0022000085710136)), a property not shared by the weaker network where the hidden-to-hidden layer is replaced by an output-to-hidden layer (page 376). - -By the way, the RNN with a hidden-to-hidden layer has the same architecture as the so-called linear dynamical system mentioned in [Hinton's video](https://www.coursera.org/learn/neural-networks/lecture/Fpa7y/modeling-sequences-a-brief-overview). - -From what I have learned, the universality of RNN and feedforward networks are therefore due to different arguments, the former coming from Turing machines and the latter from an analytical view of approximation by step functions. diff --git a/microposts/rnn-turing.md b/microposts/rnn-turing.md deleted file mode 100644 index 5c7605c..0000000 --- a/microposts/rnn-turing.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-09-18 ---- - -Just some non-rigorous guess / thought: Feedforward networks are like combinatorial logic, and recurrent networks are like sequential logic (e.g. data flip-flop is like the feedback connection in RNN). Since NAND + combinatorial logic + sequential logic = von Neumann machine which is an approximation of the Turing machine, it is not surprising that RNN (with feedforward networks) is Turing complete (assuming that neural networks can learn the NAND gate). diff --git a/microposts/sanders-suspend-campaign.md b/microposts/sanders-suspend-campaign.md deleted file mode 100644 index cd3448a..0000000 --- a/microposts/sanders-suspend-campaign.md +++ /dev/null @@ -1,4 +0,0 @@ ---- -date: 2020-04-15 ---- -Suspending the campaign is different from dropping out of the race. Bernie Sanders remains on the ballot, and indeed in his campaign suspension speech he encouraged people to continue voting for him in the democratic primaries to push for changes in the convention. diff --git a/microposts/short-science.md b/microposts/short-science.md deleted file mode 100644 index b18a21c..0000000 --- a/microposts/short-science.md +++ /dev/null @@ -1,10 +0,0 @@ ---- -date: 2018-09-05 ---- -> - ShortScience.org is a platform for post-publication discussion aiming to improve accessibility and reproducibility of research ideas. -> - The website has over 800 summaries, mostly in machine learning, written by the community and organized by paper, conference, and year. -> - Reading summaries of papers is useful to obtain the perspective and insight of another reader, why they liked or disliked it, and their attempt to demystify complicated sections. -> - Also, writing summaries is a good exercise to understand the content of a paper because you are forced to challenge your assumptions when explaining it. -> - Finally, you can keep up to date with the flood of research by reading the latest summaries on our Twitter and Facebook pages. - -[ShortScience.org](https://shortscience.org) diff --git a/microposts/simple-solution-lack-of-math-rendering.md b/microposts/simple-solution-lack-of-math-rendering.md deleted file mode 100644 index 84a6f14..0000000 --- a/microposts/simple-solution-lack-of-math-rendering.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-05-02 ---- -### Pastebin for the win -The lack of maths rendering in major online communication platforms like instant messaging, email or Github has been a minor obsession of mine for quite a while, as I saw it as a big factor preventing people from talking more maths online. But today I realised this is totally a non-issue. Just do what people on IRC have been doing since the inception of the universe: use a (latex) pastebin. diff --git a/microposts/sql-injection-video.md b/microposts/sql-injection-video.md deleted file mode 100644 index 76fadb9..0000000 --- a/microposts/sql-injection-video.md +++ /dev/null @@ -1,4 +0,0 @@ ---- -date: 2018-05-08 ---- -Computerphile has some brilliant educational videos on computer science, like [a demo of SQL injection](https://www.youtube.com/watch?v=ciNHn38EyRc), [a toy example of the lambda calculus](https://www.youtube.com/watch?v=eis11j_iGMs), and [explaining the Y combinator](https://www.youtube.com/watch?v=9T8A89jgeTI). diff --git a/microposts/stallman-resign.md b/microposts/stallman-resign.md deleted file mode 100644 index 0460280..0000000 --- a/microposts/stallman-resign.md +++ /dev/null @@ -1,8 +0,0 @@ ---- -date: 2019-09-29 ---- -Last week Richard Stallman resigned from FSF. It is a great loss for the free software movement. - -The apparent cause of his resignation and the events that triggered it reflect some alarming trends of the zeitgeist. Here is a detailed review of what happened: [Low grade "journalists" and internet mob attack RMS with lies. In-depth review.](https://sterling-archermedes.github.io/). Some interesting articles on this are: [Weekly Roundup: The Passion Of Saint iGNUcius Edition](https://jackbaruth.com/?p=16779), [Why I Once Called for Richard Stallman to Step Down](http://techrights.org/2019/09/17/rms-witch-hunt/). - -Dishonest and misleading media pieces involved in this incident include [The Daily Beast](https://www.thedailybeast.com/famed-mit-computer-scientist-richard-stallman-defends-epstein-victims-were-entirely-willing), [Vice](https://www.vice.com/en_us/article/9ke3ke/famed-computer-scientist-richard-stallman-described-epstein-victims-as-entirely-willing), [Tech Crunch](https://techcrunch.com/2019/09/16/computer-scientist-richard-stallman-who-defended-jeffrey-epstein-resigns-from-mit-csail-and-the-free-software-foundation/), [Wired](https://www.wired.com/story/richard-stallmans-exit-heralds-a-new-era-in-tech/). diff --git a/microposts/static-site-generator.md b/microposts/static-site-generator.md deleted file mode 100644 index 5804153..0000000 --- a/microposts/static-site-generator.md +++ /dev/null @@ -1,8 +0,0 @@ ---- -date: 2018-03-23 ---- -> "Static site generators seem like music databases, in that everyone eventually writes their own crappy one that just barely scratches the itch they had (and I'm no exception)." - -<a href="https://news.ycombinator.com/item?id=7747651">\_\_david\_\_\@hackernews</a> - -So did I. diff --git a/microposts/zitierkartell.md b/microposts/zitierkartell.md deleted file mode 100644 index 0279121..0000000 --- a/microposts/zitierkartell.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -date: 2018-09-07 ---- - -[Counter strategy against group that repeatedly does strategic self-citations and ignores other relevant research](https://academia.stackexchange.com/questions/116489/counter-strategy-against-group-that-repeatedly-does-strategic-self-citations-and) |