CZ:Proposals/Create a page for all notable genes in the human genome: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Andrew Su
m (Text replacement - "CZ:Workgroup Weeks" to "Archive:Workgroup Weeks")
 
(7 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{proposal assignment|Edit}}
<div class="boilerplate metadata" style="background-color: #dedaca; margin: 2em 0 0 0; padding: 0 10px 0 10px; border: 1px solid #AAAAAA;">
 
:''I suppose it's time that this proposal be official withdrawn.  Archiving the text below in case the proposal ever gets resurrected in the future. [[User:Andrew Su|Andrew Su]] 20:10, 14 July 2008 (CDT) ''
 
 
{{proposal assignment|Dless}}
At first sight, this seems to be something that the relevant workgroups (Biology and Health Sciences) can decide by themselves. However, the proposal may easily create a precedent with wide-ranging implications, for instance on what type of stubs are acceptable, and whether we want a bot write a large number of articles. For that reason I think it's best that the full Editorial Council decides.
At first sight, this seems to be something that the relevant workgroups (Biology and Health Sciences) can decide by themselves. However, the proposal may easily create a precedent with wide-ranging implications, for instance on what type of stubs are acceptable, and whether we want a bot write a large number of articles. For that reason I think it's best that the full Editorial Council decides.




'''Driver:''' [[User:Andrew Su|Andrew Su]]
'''Driver:''' None


== Complete explanation ==
== Complete explanation ==
Line 94: Line 99:
If you are relying on the enthusiasm of the current Biology Workgroup, then our answer is obviously "Nope, we can't do it."  Because obviously the enthusiasm isn't there.  This is just not how self-selecting open-ended collaborative projects work: in such projects, people pursue what interests them, and so it would be an amazing coincidence if among the biologists who happened to show up in CZ there was a strong interest in gene articles.  More to the point, the fact that there is a group and that group members do a remarkable amount of work (over 800 articles in the Biology Workgroup, including 16 approved, 73 developed, and 266 developing) does not entail that they are an ''organized'' and ''assignable'' group.  This is something that a lot of people who look at Wikipedia and CZ and similar projects never perfectly understand: we don't have a staff that we told what to do!
If you are relying on the enthusiasm of the current Biology Workgroup, then our answer is obviously "Nope, we can't do it."  Because obviously the enthusiasm isn't there.  This is just not how self-selecting open-ended collaborative projects work: in such projects, people pursue what interests them, and so it would be an amazing coincidence if among the biologists who happened to show up in CZ there was a strong interest in gene articles.  More to the point, the fact that there is a group and that group members do a remarkable amount of work (over 800 articles in the Biology Workgroup, including 16 approved, 73 developed, and 266 developing) does not entail that they are an ''organized'' and ''assignable'' group.  This is something that a lot of people who look at Wikipedia and CZ and similar projects never perfectly understand: we don't have a staff that we told what to do!


If you want to interest enough people in systematically improving the gene articles, there are two ways to do this, it seems to me.  The first is to wait for a few years when, hopefully, CZ is the main game in town for credible encyclopedic information, and we've got zillions of biologists already involved or willing to get involved.  The second is to recruit geneticists (i.e., people who are willing to work on the gene articles) systematically.  You ''might'' be able to use the [[CZ:Workgroup Weeks|Workgroup Weeks]] framework I've set up (perhaps you already had that thought?) to accomplish that.  We could discuss the details of that.  But I would caution that you can't expect people to jump on the bandwagon even of getting the Workgroup Week started.  As you can see on [[CZ:Workgroup Weeks]], not even many of the most active Citizens are interested in helping.  If you wanted to guarantee that a Gene (or Genetics) Week would happen--in which case I would probably support uploading many gene articles for people to work on--you'd either have to show that there were many people willing to help you out, or you'd have to tell us that you'd do all the work yourself!
If you want to interest enough people in systematically improving the gene articles, there are two ways to do this, it seems to me.  The first is to wait for a few years when, hopefully, CZ is the main game in town for credible encyclopedic information, and we've got zillions of biologists already involved or willing to get involved.  The second is to recruit geneticists (i.e., people who are willing to work on the gene articles) systematically.  You ''might'' be able to use the [[Archive:Workgroup Weeks|Workgroup Weeks]] framework I've set up (perhaps you already had that thought?) to accomplish that.  We could discuss the details of that.  But I would caution that you can't expect people to jump on the bandwagon even of getting the Workgroup Week started.  As you can see on [[Archive:Workgroup Weeks]], not even many of the most active Citizens are interested in helping.  If you wanted to guarantee that a Gene (or Genetics) Week would happen--in which case I would probably support uploading many gene articles for people to work on--you'd either have to show that there were many people willing to help you out, or you'd have to tell us that you'd do all the work yourself!


I don't mean to be discouraging and it looks ungrateful (looking a gift horse in the mouth, as it were), but it all comes down to keeping CZ relatively free of "cruft" that will never be maintained.  I can imagine various ways that the project ''could'' be maintained.  It's just that it would take considerable leadership--and, since no one else is currently in a position to take that on, it would have to be leadership from you, I'm afraid! --[[User:Larry Sanger|Larry Sanger]] 10:48, 19 May 2008 (CDT)
I don't mean to be discouraging and it looks ungrateful (looking a gift horse in the mouth, as it were), but it all comes down to keeping CZ relatively free of "cruft" that will never be maintained.  I can imagine various ways that the project ''could'' be maintained.  It's just that it would take considerable leadership--and, since no one else is currently in a position to take that on, it would have to be leadership from you, I'm afraid! --[[User:Larry Sanger|Larry Sanger]] 10:48, 19 May 2008 (CDT)
Line 146: Line 151:
::Hmm, okay, a bit more poking around helped...  This would be a visual view of available content which would enhance your proposal above for the create-on-demand bot.  (One doesn't seem to depend on the other, right?)  Oh, and fyi, we have something planned that would be have like the [http://diberri.dyndns.org/cgi-bin/templatefiller/index.cgi?ddb=&type=hgnc_id&id=2475 Diberri tool], but that's still quite some ways off.  And, I'm not sure how we'd get it to work with subpages.  Anyway, that I think is also a very good create-on-demand option. [[User:Andrew Su|Andrew Su]] 15:01, 22 May 2008 (CDT)
::Hmm, okay, a bit more poking around helped...  This would be a visual view of available content which would enhance your proposal above for the create-on-demand bot.  (One doesn't seem to depend on the other, right?)  Oh, and fyi, we have something planned that would be have like the [http://diberri.dyndns.org/cgi-bin/templatefiller/index.cgi?ddb=&type=hgnc_id&id=2475 Diberri tool], but that's still quite some ways off.  And, I'm not sure how we'd get it to work with subpages.  Anyway, that I think is also a very good create-on-demand option. [[User:Andrew Su|Andrew Su]] 15:01, 22 May 2008 (CDT)
:::I am basically quite sympathetic to bot-assisted fact-picking from any suitable database, and genes could be a good start. However, I agree with David in that bot-assisted content (though it can quickly get to high standards nowadays, as highlighted by diberri, [http://esciencenews.com/ Eureka science news] and other sites) is not necessarily what people look for in an encyclopedia, and I think an overwhelming mass of anything (genes, airplanes or sandpaper types) in the Random Articles might deter more non-specialists than it brings in specialists (not sure how much this feature is used, though, and by whom). Both the on the fly creation and transfer from a different namespace would alleviate this problem. I would be fine either way, and so would the test page. What I imagine in the automated part is that we have a list containing nothing else than <nowiki>{{gla|Gene_001}} ... to {{Gla|Gene_400}}</nowiki> (that's the quick part) to which the template (once functional) could then add contents that the bot has provided ''somewhere on CZ, perhaps not in the main namespace''. This content would include data for the page-to-be plus a definition like "''{{def|APP}}''" for [[APP]]. Once we have agreed on whether to put what bot-assisted information where, the template can be told to gather it there. More documentation on the template is at {{tl|Gla}}. -- [[User:Daniel Mietchen|Daniel Mietchen]] 18:49, 22 May 2008 (CDT)
:::I am basically quite sympathetic to bot-assisted fact-picking from any suitable database, and genes could be a good start. However, I agree with David in that bot-assisted content (though it can quickly get to high standards nowadays, as highlighted by diberri, [http://esciencenews.com/ Eureka science news] and other sites) is not necessarily what people look for in an encyclopedia, and I think an overwhelming mass of anything (genes, airplanes or sandpaper types) in the Random Articles might deter more non-specialists than it brings in specialists (not sure how much this feature is used, though, and by whom). Both the on the fly creation and transfer from a different namespace would alleviate this problem. I would be fine either way, and so would the test page. What I imagine in the automated part is that we have a list containing nothing else than <nowiki>{{gla|Gene_001}} ... to {{Gla|Gene_400}}</nowiki> (that's the quick part) to which the template (once functional) could then add contents that the bot has provided ''somewhere on CZ, perhaps not in the main namespace''. This content would include data for the page-to-be plus a definition like "''{{def|APP}}''" for [[APP]]. Once we have agreed on whether to put what bot-assisted information where, the template can be told to gather it there. More documentation on the template is at {{tl|Gla}}. -- [[User:Daniel Mietchen|Daniel Mietchen]] 18:49, 22 May 2008 (CDT)
::::Does anyone know how many people get to CZ via a web search to a specific article page, and how many people go directly to CZ and click the Random Article link?  This seems to be a point of great concern, and I'm curious what the numbers say.  [[User:Andrew Su|Andrew Su]] 19:16, 22 May 2008 (CDT)


== What distinguishes genes from all other categories ==
== What distinguishes genes from all other categories ==
Line 153: Line 160:


:On slightly tangential note, I'm happy that there is some activity here again.  But if this is going to take an EC resolution to resolve this one way or another, it would help if there were actually EC members here to discuss and ask questions.  Then, "regular folks" like us can state our cases one way or another so the EC can make an informed decision.  So any EC members watching?  If not, then I think this proposal is destined to die by neglect...  (And scanning through the recent EC activity, gotta say, if EC has time to vote on a systematic recipe effort and not one on genes/proteins, yikes...)  Cheers, [[User:Andrew Su|Andrew Su]] 19:06, 22 May 2008 (CDT)  Apologies, just noticed that both David and Chris are EC members.  I withdraw my plea for greater EC involvement, and I'll leave it up to you guys when/if to formally propose it...  [[User:Andrew Su|Andrew Su]] 19:12, 22 May 2008 (CDT)
:On slightly tangential note, I'm happy that there is some activity here again.  But if this is going to take an EC resolution to resolve this one way or another, it would help if there were actually EC members here to discuss and ask questions.  Then, "regular folks" like us can state our cases one way or another so the EC can make an informed decision.  So any EC members watching?  If not, then I think this proposal is destined to die by neglect...  (And scanning through the recent EC activity, gotta say, if EC has time to vote on a systematic recipe effort and not one on genes/proteins, yikes...)  Cheers, [[User:Andrew Su|Andrew Su]] 19:06, 22 May 2008 (CDT)  Apologies, just noticed that both David and Chris are EC members.  I withdraw my plea for greater EC involvement, and I'll leave it up to you guys when/if to formally propose it...  [[User:Andrew Su|Andrew Su]] 19:12, 22 May 2008 (CDT)
::Andrew, what I am asking is why should we devote progammer/developer time for this initiative vs others.  I think if we accept this proposal, there is no justification to deny all of the other lists.  This would seem to entail, at least initially, the create of a new name space so as to drop our average word count to 30.  There are so many data bases available these days.  As for the drug articles, I am actually doing them one at a time, up to perhaps 120 or more so far, with structures, etc, but there are at least 4000 drugs out there now.  So my arguments so far to date have basically been "why this subject".  Is it more worthy than others.  Are you going to do the programming?
::Could we do 100 at a time and then have you develope each one, then do the next 100? [[User:David E. Volk|David E. Volk]] 22:02, 22 May 2008 (CDT)
:::Nope, we can do bot programming, but not MW programming.  If the EC feels like the namespace change is necessary, then I agree it significantly detracts from the desirability of this proposal on both ends.  As to the question "why this subject", again, it's a subject that I'm interested in and that I am willing to devote time and resources to.  As to the proposal of doing 100 at a time, I'd suggest the minimum is 1000 to make it worth everyone's effort.  Again, the point is not that ''I'm'' going to develop these articles, but that these are stubs to draw new editors in.  If the EC feels that this effort is not likely to bring new editors in, or that it's not worth having the "cruft" laying around, then that would be a good reason to pass on the proposal.  [[User:Andrew Su|Andrew Su]] 23:31, 22 May 2008 (CDT)
::::To get an idea what kind of information could be similarly harvested by article creation bots (and on what scale), [http://en.wikipedia.org/wiki/Wikipedia:Size_comparisons#Size_of_other_information_collections WP's size comparison of information collections on the web] might be of interest. -- [[User:Daniel Mietchen|Daniel Mietchen]] 04:52, 23 May 2008 (CDT)


==Straw poll==
==Straw poll==
Line 164: Line 178:


{{Proposals navigation}}
{{Proposals navigation}}
</div>

Latest revision as of 02:45, 8 March 2024