Info Beasiswa 2015 : PhD position on semantic hashing

20.4.15 |


=====================
 PhD thesis position
=====================


Summary :
---------
3-years position in Computer Science
Laboratoire Hubert Curien, UMR CNRS 5516, Saint-Etienne, France.
Apply before 7th may 2015 to christophe.gravier@univ-st-etienne.fr
Start your thesis between september and october 2015.


For more information, contact Christophe Gravier <christophe.gravier@univ-st-etienne.fr> or Julien Subercaze <julien.subercaze@univ-st-etienne.fr>




Description :
-------------

On the Web and in social networks, textual similarity search is a problem of the utmost practical interest.
Devising metrics for textual contents (Web pages, tweets, ..) serves the purpose of today Web services â?" search engine, recommender system, online advertising, ... to name a few. 


In this context, one of the popular strategy to overcome scalability issues is Semantic Hashing, which has been proposed in the early 2000 [2, 4]. Semantic hashing aims at embedding the data points from high dimensional spaces of traditional approaches into a Hamming hypercube of size n. In semantic hashing, each data point is associated to a binary code of length n, so that their Hamming distance is similar to the one in the original space. The problem to find how to perform such an embedding provides a hot topic for the computer scientists. Many semantic hashing schemes rely on machine learning: using a corpus, binary classifiers are trained in order to satisfy different optimisation functions [6, 12, 13, 11, 3, 8, 5].


The main issues with what is commonly accepted are the following:
- data dependency : the entire corpus needs to be known in advance â?" or a sufficient portion of it making the approach also subject to cold start.
- concept-insensitive : as these processes relies on keyword feature spaces, two documents semantically similar but making use of different terms are not mapped to close binary codes.
- language-dependant : while some of the most recent works consider hashing textual and multimedia items together, few works focus on multi-lingual corpus [10].


Instead of high dimensional vector spaces, we consider in this thesis other document representation as candidates for hashing. We especially consider graphs as a document representation that can be tuned to be data-independant, concept-sensitive, and language-independant. We aim at exploring further if this representation is more suitable for semantic hashing. This track has been settled in the institute in the last couple of years. Especially, in [9] we demonstrated that a graphical model is a suitable document representation for semantic hashing, as it presents an interesting an massive speed-up for pairwise similarity computation at the cost of a limited loss of semantic similarity with respect to high dimensional models. In [1], we extended our model and we show that an external taxonomy used in the semantic hashing scheme provide concept-sensitivity to our semantic hashing process. The candidates are strongly advised to read these two publications from the team.


In this thesis, we aim at improving this method and evaluate what are the performances and limitations of a semantic hashing scheme based on a graphical representation of documents.
We will put focus on under-studied applications of semantic hashing : multi-lingual semantic hashing and speed up of natural language processing tasks.


Supervisors : Christophe Gravier and Julien Subercaze




Requirements:
-------------


The candidate MUST have :
1. a Master degree in Computer Science,
2. a good mathematical background,
3. a strong background in programming, Java experience is a plus,
4. excellent english writing skills.


In addition, although an obvious and very useful quality in general, it is not mandatory to speak French to apply.




How to apply:
--------------


* Agenda


Submission deadline : May 7th 2015 11:59pm Paris time.
Interview, if selected : between 11-13 May 2015.
Notification : between 10-15 June


* Application file


Your application file MUST contain the following items :
1. A curriculum vitae,
2. The master diploma as a PDF file,
3. The details of your marks for the two last years of the Master,
4. Contact details of two associate professors or professors you had interacted with as reference.


Your application file MAY also contain any information or link you think is appropriate (e. g. scientific publication you have contributed to, online service you maintain, link to your social coding account, ...).


You must send your application as a single, zipped, PDF file to : christophe.gravier@univ-st-etienne.fr




* Evaluation of applications


All applications will be reviewed by all the members of the Knowledge and Representation project, a subdivision of the Connected Intelligence group at Hubert Curien laboratory.
A first selection will be made from your application folder.
If selected, you will enter a second phase of selection based on three exerices :
- A motivation interview so that we can know each others. The interview includes the advisors, but also the other researchers from the laboratory with a more remote perspective on the research,
- A technical assignment : a basic 2-hours Java (or C++) programming task.
- An english assignment.


The entire process may not exceed half a day and are all held the same day.
On that day, you are invited to come to the laboratory.




Location :
----------


Saint-Etienne is located about 2 hours from the mediterranean sea and 2 hours from Alps slopes. Lyon city is at 50 km. Saint Etienne has about 180.000 inhabitants including more than 20.000 students. Surrounded by hills where hiking and mountain biking are significant, Saint Etienne is also member of the Unesco Creative Cities Network for design.




References :
------------


[1] Bamba, P., Subercaze, J., Gravier, C., Benmira, N., and Fontaine, J. (October, 30th 2012). The Twitaholic Next Door. In Proc. of 21st ACM International Conference on Information and Knowledge Management (CIKMâ?T12), pages 2275â?"2278, Maui, Hawaiâ?Ti, USA. ACM.
[2] Gionis, A., Indyk, P., Motwani, R., et al. (1999). Similarity search in high dimensions via hashing. In VLDB, volume 99, pages 518â?"529.
[3] Gu, X., Zhang, Y., Zhang, L., Zhang, D., and Li, J. (2013). An improved method of locality sensitive hashing for indexing large-scale and high-dimensional features. Signal Processing, 93(8):2244â?"2255.
[4] Indyk, P. and Motwani, R. (1998). Approximate nearest neighbors: towards removing the curse of dimensionality. In Proceedings of the thirtieth annual ACM symposium on Theory of computing, pages 604â?"613. ACM.
[5] Ji, J., Li, J., Yan, S., Zhang, B., and Tian, Q. (2012). Super-bit locality-sensitive hashing. In NIPS, pages 108â?"116.

[6] Lin, R.-S., Ross, D. A., and Yagnik, J. (2010). Spec hashing: Similarity preserving algorithm for entropy-based coding. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 848â?"854. IEEE.
[7] Matousek, J. (2013). Lecture notes on metric embeddings. In Department of Applied Mathematics, Czech Republic, and Institute of theoretical Computer Science Zurich, page 126 pages.
[8] Shrivastava, A. and Li, P. (2014). Densifying one permutation hashing via rotation for fast near neighbor search. In Proceedings of The 31st International Conference on Machine Learning, pages 557â?"565.
[9] Subercaze, J., Gravier, C., and Laforest, F. (2013). Towards an expressive and scalable twitterâ?Ts users profiles. In Proceeding of Web Intelligence, pages 101â?"108.
[10] Ture, F., Elsayed, T., and Lin, J. (2011). No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, pages 943â?"952. ACM.
[11] Wang, Q., Si, L., Zhang, Z., and Zhang, N. (2014). Active hashing with joint data example and tag selection. In Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, pages 405â?"414. ACM.
[12] Zhang, D., Wang, J., Cai, D., and Lu, J. (2010). Self-taught hashing for fast similarity search. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 18â?"25. ACM.
[13] Zhang, L., Zhang, Y., Tang, J., Gu, X., Li, J., and Tian, Q. (2013). Topology preserving hashing for similarity search. In Proceedings of the 21st ACM international conference on Multimedia, pages 123â?"132. ACM.
_______________________________________________

__._,_.___

Posted by: Tri Kurniawan Wijaya <trikurniawanwijaya@yahoo.com>

Info Beasiswa 2015 : PhD position at the Max Planck Graduate Center in Mainz, Germany

|

Dear all,

FYI.
 
Best regards,
Iwan H. Sahputra



PhD position at the Max Planck Graduate Center in Mainz, Germany

Metal-organic framework growth: Mechanistic insight for tailored device design

Job offer from March 17, 2015
Metal-organic frameworks (MOFs) are increasingly attracting attention as versatile functional materials for sensing, catalysis, optoelectronics and energy storage/conversion applications. Particularly attractive is the possibility of strategic combination of metallic and organic building blocks to create custom-tailored MOF materials with desired structure and functionality. In order to achieve such systematic MOF device design, fundamental understanding of the nucleation phase and early growth stages that govern the final material properties is urgently required.
The overall aim of the proposed PhD project is to derive a theoretical model for electrochemically controlled MOF growth that relates relevant tunable synthesis parameters, such as initial structure and composition, to the resulting MOF 2D/3D structure and chemical functionality.
More specific, we will investigate how metal/organic-linker/dopant concentrations and electrochemical synthesis potential govern the structure and crystallinity as well as the chemical sensing properties of a showcase luminescent Zn-MOF routinely used in advanced sensing applications. With help of such a theoretical model, bottom-up fabrication protocols can be developed that enable rational design of functional MOF materials.
The project will involve the advance of current computational electrochemistry techniques on a multiscale approach where small size electronic structure calculations will be combined with largescale atomistic simulations including reactive force fields (Jun. Prof. M. Sulpizi, JGU). The simulated models will be tested against experimental data, namely electrochemical vibrational spectroscopy (nearfield and nonlinear Raman, infrared) and complementary in situ scanning probe techniques (STM, AFM) (Dr. K.F. Domke, MPIP). The powerful combination of theoretical and experimental insight will provide a unique base for the development of strategic preparation protocols for MOFs with tailored architectures and functionalities.

Who can apply?
Mandatory requirements are: - master in Physics, Chemistry or Materials Science - good knowledge of Statistical Mechanics or Electronic Structure Theory; good knowledge of modern programming and scripting languages; proficiency in spoken and written English language. 
Application process
Applications including a statement of interest, curriculum vitae and name and address of at least one reference person, should be sent to sulpizi@uni-mainz.de.

__._,_.___

Posted by: Iwan Halim Sahputra <halimits@yahoo.com>

Info Beasiswa 2015 : PhD Position in Atomistic Mechanics of Glasses, Karlsruhe Institute of Technology

|


Dear all,

FYI.
 
Best regards,
Iwan H. Sahputra

PhD Position in Atomistic Mechanics of Glasses, Karlsruhe Institute of Technology
Friction and wear are important processes that determine the function of many mechanical devices, but their underlying microscopic physical principles are complex and often not well understood. Amorphous – or glassy – materials often form on surfaces that have experienced frictional loading. The research will involve large-scale molecular dynamics simulation of the near-surface deformation upon indentation and scratching of network and bulk metallic glasses. The goal is to build a fundamental understanding of energy dissipation channels and plastic deformation processes in these materials, and to identify potential differences in the behavior of ceramic and metallic glass formers. The project will also make connection with higher level meso- and macroscale theories for the description of the plasticity of glasses.
The research will be carried out in the group of Dr. Lars Pastewka in a project funded by the Emmy-Noether program of the German Research Foundation. The group is embedded in the MicroTribology Center (μTC) which bundles tribology-related research activities at KIT-IAM and Fraunhofer IWM. KIT provides excellent infrastructure for computational research, including multiple high-performance computing platforms and large-data storage facilities.
The candidate has an MSc (or equivalent) with a strong focus on Material Science, Mechanics, Physics or Chemistry. Additional knowledge of glasses, numerical methods, atomic-scale simulation, and experience with Python and C/C++ is an asset.
Remuneration occurs on the basis of the wage agreement of the civil service in TV-L. The application deadline is May 31, 2015.
For more information please contact Dr. Lars Pastewka, Email: lars.pastewka@kit.edu. Interested candidates are asked to send a cover letter, curriculum vitae, transcripts and contact information for at least one academic reference to the IAM-CMS front office reachable at office-CMS@iam.kit.edu.
KIT is an equal opportunity employer. Women are especially encouraged to apply. Applicants with disabilities will be preferentially considered if equally qualified.

Info Beasiswa Terbaru 2015 : Postdoc and Ph.D. positions in theoretical machine learning @ Inria Lille, France.

|

A postdoc and   Ph.D. position is offered at Inria Lille, in collaboration with CWI, Amsterdam.
The successful applicants will work with Daniil Ryabko at Inria, and will collaborate
with Peter Grünwald at CWI, Amsterdam, spending some of  time in Amsterdam.
(For postdoc between 3 and 6 months.)

Application Deadline: 26/04/2015, but better apply earlier
Duration: 36 months for PhD, 16 months for postdoc
Starting date: October, 2015

Please contact Daniil dot Ryabko at inria.fr before applying, with  [phd] in the subject line.

The topic is non-parametric sequential prediction.
The topic belongs to the areas of machine learning and (extremely) nonparametric statistics. The central theme of this topic is to explore which regularities are "learnable" from sequential data.
Specifically, this general question is considered for the problem of probability forecasting, that is, predicting the probabilities of future outcomes of a series of events given the past. The question to be addressed is: under which assumptions on the stochastic mechanism generating the data is it possible to give forecasts whose error becomes negligible as more data becomes available? Here we specifically allow for the possibility that the predictions are based on a model that is `wrong yet useful', i.e. it does not contain the data generating mechanism. In this 'nonrealizable' or 'misspecified' case, the question becomes: under what conditions it is possible to give forecasts that converge to the best available ones as more data becomes available?

Questions of this kind find applications in a variety of fields, such as finance, data compression, bioinformatics, environmental sciences,  and many others. However, the research topic is mainly about theoretical foundations rather than applications.


Background papers: paper1 (Ryabko), paper2 (Ryabko), paper3 (Grünwald/van Ommen)

The successful applicant will have a strong mathematical background with an M.Sc. in mathematics, computer science or statistics.


About Inria and the job

Established in 1967, Inria is the only public research body fully dedicated to computational sciences. Combining computer sciences with mathematics, Inria’s 3,500 researchers
with 350 working at the Inria  centre in Lille.

Lille is only 1h away from Paris, 34min from Brussels and 1h30 from London -  all by train.

Benefits: Possibility of French courses, Help for housing, Financial support from Inria to catering and transportation expenses, Scientific Resident card and help for visa, Catering service

Monthly salary after taxes: around 1580 € the 1st two years and 1660 € the 3rd year (social security included).

About CWI:

CWI is the national research institute for mathematics and computer science in the Netherlands, located in Amsterdam. It conducts pioneering research in these fields and transfers its results to society. With 55 permanent research staff, 40 postdocs and 70 PhD students, CWI is a compact institute that lies at the heart of European research in mathematics and computer science. It was the birthplace of the European internet and was home to the invention of the popular programming language Python. CWI is located within easy biking distance from the centre of one of Europe's most beautiful, lively and international cities.
--
You received this message because you are subscribed to the Google Groups "Machine Learning News" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ml-news+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout

Info Beasiswa 2015 : Internship on data analysis at the Pulse Lab Jakarta

|


Pulse Lab Jakarta is a joint programme between the United Nations and
Government of Indonesia through UN Global Pulse (http://unglobalpulse.org/)
and the Indonesian Ministry of National Development Planning (Bappenas).
Our mission is to harness big data for development and humanitarian action
and an internship at our lab can offer you a distinguished, unique
opportunity to help tackle development challenges with the public sector,
applying your skills around data analytics, modeling and visualization.



As a data innovation lab, the Research team at Pulse Lab Jakarta, together
with our colleagues in Pulse Lab New York and Pulse Lab Kampala, are
exploring various opportunities of applying big data in different sectors,
such as food security, public health, urban dynamics and economic
well-being, with different types of big data, such as social media, online
news, remote sensing, and online search keywords. This year, in particular,
we hope to test other methodologies and approaches by using other data
sources, e.g., financial transactions and mobility data.



We are looking for talented PhD students who have already published papers
at top-tier conferences and journals in a broad area of data science,
including (but not limited to) social media analysis, user behavior
modeling, statistical model, informational retrieval, NLP, predictive
analytics, complex network analysis, geospatial analysis, image processing,
and data visualization.



Starting dates and internship are negotiable depending on the selected
candidates' availability (preferably, two to three months, at least).
Applications for the internship should be received by *April 24th, 2015**. *
Please submit an application with your CV and motivation letter (clearly
highlighting the speciality, previous research, and/or which sector you
want to work in, if any) to Jong Gun Lee, the Data Scientist and Research
Lead at Pulse Lab Jakarta, jonggun.lee@un.or.id. Applications will be
evaluated on a rolling basis. We will invite shortlisted candidates for
interviews.

Beasiswa S1 S2 dan S3 di Turki 2015

12.3.15 |

Applications for Türkiye Scholarships will start by February 25th. The application deadline is 31st March 2015. Applications for undergraduate, master and PhD programmes will received through the same application cycle.

The application to Türkiye Scholarships programmes is online and totally free process.

Candidates can apply only one scholarship programme in accordance with their educational background and academic goals.

Türkiye Scholarships include both scholarship and university placement at the same time. Applicants will be placed in a university and programme among their preferences specified in the online application form.

Most programmes in Turkish universities are instructed in Turkish. However, some departments and universities offer programmes in English or French. The candidates who want to study in these languages need to have an internationally recognized certificate to prove their language proficiency. Also, most of these foreign language programmes may ask international admission test scores such as GRE, GMAT etc. Candidates can check the language of instruction while selecting the programmes.

All Türkiye Scholarships winners who do not have C1 Level Certificate for Turkish Proficiency (including those who are placed into English or French-taught programmes) must attend 1-year Turkish Preparatory Course.

Art and Sport Scholarship Programmes, available for the first time this year, is specifically designed to encourage the skilled international students in the fields of art and sport to study at undergraduate, master or PhD (proficieny in art) degree levels in the prestigious Turkish universities with opportunities of Türkiye Scholarships. Applicants selected to be funded take one-year Turkish Language Course covered by scholarship. Subsequently, they have to receive an acceptance from the relevant faculties of Turkish universities. Applicants are expected to have documents (offical certificate, portfolio, etc.) to prove their success in the field they apply for.

The Scholarship Covers...

• Monthly stipend (600 TL for undergraduate, 850 TL for master and 1200 TL for PhD )
• Full tuition fee
• Free 1-year Turkish language course
• Free accommodation
• Round-trip air ticket
• Health insurance
 
To be eligible for Undergraduate Programmes, applicants must;

• be a citizen of a country other than Turkey (Anyone holding or ever held Turkish citizenship before cannot apply),
• not be a registered student in Turkish universities at the level of study they are applying,
• be have graduated or are likely to graduate from a secondary education institution,
• be under the age of 21 (no earlier than 01.01.1994),
• have at least specified (% 90 for medical sciences, % 70 for other undergraduate programs) cumulative grade point average, diploma grade, national exam or international exam score,
• be in good health
 
To be eligible for Postgraduate Programmes, applicants must;

• be a citizen of a country other than Turkey (Anyone holding or ever held Turkish citizenship before cannot apply)
• not be a registered student in Turkish universities at the level of study they are applying.
• be a bachelor's or master's degree holder by 30th of July 2015 at the latest
• be under the age of 30 for master's programmes (no earlier than 01.01.1985)
• be under the age of 35 for doctorate programmes (no earlier than 01.01.1980)
• have at least 75 % cumulative grade point average or diploma grade over their maximum graduation grade or have at least 75 % success in any accepted national or international graduate admissions test.
• be in good health
 
Required Documents

Undergraduate Level

• Online application
• A copy of a secondary school diploma or document indicating that the candidate is secondary school senior student,
• A certified secondary school transcript (indicating courses taken and relevant grades of the candidate)
• A copy of a valid ID card (passport, national ID, birth certificate etc.)
• Passport photo

Postgraduate Level
 
• Online application
• A copy of a bachelor or master's diploma or document indicating that the candidate is bachelor or master's senior student
• A certified bachelor and/or master's transcript (indicating courses taken and relevant grades of the candidate)
• A copy of a valid ID card (passport, national ID, birth certificate etc.)
• Passport photo

Application Procedure

Candidates are supposed to;

• go to www.turkiyeburslari.gov.tr
• click the online application form,
• create an account by entering an e-mail and password,
• read the application guide,
• log in to the system and fill out the application form,
• upload the required documents,
• and finally finish the application.
 
Click here to reach the Scholarship Programmes brochures and applications
 
Click here to apply

Related Posts Plugin for WordPress, Blogger...
Get Chitika Premium