python - How do I scrape an argument of a javascript function inside of a javascript html tag? -


i want scrape out arguments of dygraph function(the long line of dates mainly), points on graph. until now, scraping other kinds of tags gettable using findall function, however, looks need dig deeper that in problem.

<script type="text/javascript">      g = new dygraph(  // containing div document.getelementbyid('dailysubscribers'), // csv or path csv file. "date,daily subs\n" + "2016-07-31,1\n" + "2016-08-01,1\n" + "2016-08-02,0\n" + "2016-08-03,1\n" + "2016-08-04,0\n" + "2016-08-05,2\n" + "2016-08-06,10\n" + "2016-08-07,5\n" + "2016-08-08,1\n" + "2016-08-09,1\n" + "2016-08-10,2\n" + "2016-08-11,0\n" + "2016-08-12,0\n" + "2016-08-13,0\n" + "2016-08-14,0\n" + "2016-08-15,1\n" + "2016-08-16,1\n" + "2016-08-17,0\n" + "2016-08-18,0\n" + "2016-08-19,1\n" + "2016-08-20,0\n" + "2016-08-21,1\n" + "2016-08-22,0\n" + "2016-08-23,0\n" + "2016-08-24,7\n" + "2016-08-25,2\n" + "2016-08-26,0\n" + "2016-08-27,1\n" + "2016-08-28,1\n" + "2016-08-29,0\n" + "2016-08-30,0\n" + "2016-08-31,0\n" + "2016-09-01,0\n" + "2016-09-02,0\n" + "2016-09-03,0\n" + "2016-09-04,0\n" + "2016-09-05,1\n" + "2016-09-06,0\n" + "2016-09-07,0\n" + "2016-09-08,0\n", {         title: 'daily subs gained uczx2vmisqqlwzqwgwubfqqa ',         legend: 'always',         ylabel: 'daily subs',         titleheight: 20,         labelsdivstyles: {                         'background': 'none',                         'margin-top': '-10px',                         'text-align': 'right',                       },         strokewidth: 1,         colors: ["#dd2323",                  "#dd2323",                  "#dd2323",                  "#dd2323"],         labelskmb: true,         maxnumberwidth: 10         } ); </script> 

here quick way solve (bruteforce working)

bs = beautifulsoup(data, 'html.parser') print(bs) values = (str(bs).split('"date,daily subs\\n" +')[1].split(', {')[0].replace('\\n" + "', " ").replace('\\n', " ").replace("\"", "").split(" "))[1:-1] print(values) 

output:

<script type="text/javascript">g = new dygraph(// containing divdocument.getelementbyid('dailysubscribers'),// csv or path csv file."date,daily subs\n" + "2016-07-31,1\n" + "2016-08-01,1\n" + "2016-08-02,0\n" + "2016-08-03,1\n" + "2016-08-04,0\n" + "2016-08-05,2\n" + "2016-08-06,10\n" + "2016-08-07,5\n" + "2016-08-08,1\n" + "2016-08-09,1\n" + "2016-08-10,2\n" + "2016-08-11,0\n" + "2016-08-12,0\n" + "2016-08-13,0\n" + "2016-08-14,0\n" + "2016-08-15,1\n" + "2016-08-16,1\n" + "2016-08-17,0\n" + "2016-08-18,0\n" + "2016-08-19,1\n" + "2016-08-20,0\n" + "2016-08-21,1\n" + "2016-08-22,0\n" + "2016-08-23,0\n" + "2016-08-24,7\n" + "2016-08-25,2\n" + "2016-08-26,0\n" + "2016-08-27,1\n" + "2016-08-28,1\n" + "2016-08-29,0\n" + "2016-08-30,0\n" + "2016-08-31,0\n" + "2016-09-01,0\n" + "2016-09-02,0\n" + "2016-09-03,0\n" + "2016-09-04,0\n" + "2016-09-05,1\n" + "2016-09-06,0\n" + "2016-09-07,0\n" + "2016-09-08,0\n", {        title: 'daily subs gained uczx2vmisqqlwzqwgwubfqqa ',        legend: 'always',        ylabel: 'daily subs',        titleheight: 20,        labelsdivstyles: {                        'background': 'none',                        'margin-top': '-10px',                        'text-align': 'right',                      },        strokewidth: 1,        colors: ["#dd2323",                 "#dd2323",                 "#dd2323",                 "#dd2323"],        labelskmb: true,        maxnumberwidth: 10        });</script> ['2016-07-31,1', '2016-08-01,1', '2016-08-02,0', '2016-08-03,1', '2016-08-04,0', '2016-08-05,2', '2016-08-06,10', '2016-08-07,5', '2016-08-08,1', '2016-08-09,1', '2016-08-10,2', '2016-08-11,0', '2016-08-12,0', '2016-08-13,0', '2016-08-14,0', '2016-08-15,1', '2016-08-16,1', '2016-08-17,0', '2016-08-18,0', '2016-08-19,1', '2016-08-20,0', '2016-08-21,1', '2016-08-22,0', '2016-08-23,0', '2016-08-24,7', '2016-08-25,2', '2016-08-26,0', '2016-08-27,1', '2016-08-28,1', '2016-08-29,0', '2016-08-30,0', '2016-08-31,0', '2016-09-01,0', '2016-09-02,0', '2016-09-03,0', '2016-09-04,0', '2016-09-05,1', '2016-09-06,0', '2016-09-07,0', '2016-09-08,0'] 

Comments

Popular posts from this blog

ZeroMQ on Windows, with Qt Creator -

unity3d - Unity SceneManager.LoadScene quits application -

python - Error while using APScheduler: 'NoneType' object has no attribute 'now' -